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Foreword 


The fundamental constant 7 has played an indispensable role in mathematics, sci- 
ence, and most world cultures for over 4000 years. One would like to calculate 
the circumference of a circle without actually physically measuring it. The ancient 
Babylonians and Hebrews recognized that it was considerably easier to determine 
the radius of a circle than its circumference, and so the more physically demand- 
ing task of calculating the circumference could be replaced by simply multiplying 
the radius by 3. However, before 2000 BCE, the Babylonians replaced this ini- 
tial approximation of 3 for 7 by 3g. On the Egyptian Rhind Papyrus from about 
1650 BCE, we find the slightly better approximation 4(8/9)? = 3.16049.... We 
have come a long way since the heydays of the Babylonian, Hebrew, and Egyptian 
cultures. With the calculation of “houkouonchi” and Alexander J. Yee, we now 
“know” 13.3 trillion digits of 7. It might be pointed out that 7 was not always 
denoted by 7; William Jones first used the notation 7 in 1706. Euler adopted the 
symbol by 1737, and since that time, the notation 7 has been universally used. 

The value 7 arises in many contexts in mathematics in surprising ways. One 
of these is the Buffon needle problem, which asks for the probability that a needle 
of length @ will land on one of the vertical, equally spaced parallel lines on a floor, 
where the distance between each two adjacent lines is equal to d. If d = @, this 
probability is 2/7. Any calculus student will testify that the value of 7 often arises 
in the calculation of a definite integral or of an infinite series in closed form. The 
great Indian mathematician, Srinivasa Ramanujan, evaluated a plethora of integrals 
and infinite series in closed form, and he also found many beautiful series identities. 
Chapter 14 in his second notebook contains 87 such results, and of these series and 
integral evaluations in closed form and series identities, I counted 70 of them in 
which 7 appears. 

Unusual or surprising formulas for 7 are a source of delight for many of us. 
When we calculate the simple continued fraction 


1 1 1 1 
7+ 15+ 1+ 293 + eee’ 


of 7, we are perhaps surprised by the “large” denominator 293. This indicates that 


if we truncate the continued fraction just prior to this large denominator, we should 
obtain a good approximation to 7. Indeed, the associated approximation ae gives 


the first six digits of 7. On the other hand, examining the simple continued fraction 


lez ois fst 7 1 
i 2+2+23+1+4 16539 + cece’ 


for ++, we find the “huge” denominator 16539. This leads to the approximations 


mT=3+ 


rr | 97.5 = 97.40909090909...., a | 3.14159265262..., 


while 
m* = 97.409091034002..., a =3.14159265358.... 


x FOREWORD 


Infinite series, perhaps beginning with the familiar Madhava—Gregory series for 
Arctan x, have played important roles in calculating the digits of 7. One of the 
most beautiful series representations for 7 is given by 


9801 _ 25 (4n)! a8 + 26390n) 
)4(396)4" 


(1) 
n=0 
stated without proof along with 16 further such series for 1/7 by Ramanujan in 
his famous paper, Modular equations and approximations to 7. In November, 1985, 
R. William Gosper, Jr., employed (1) to calculate 17,526,100 digits of 7, which was 
a world record at that time. Each term of (1) gives about 8 digits of 7 per term. 
Perhaps we should have put quotation marks about world record, because at that 
time no proof of (1) had ever been given. We now have proofs of (1), Ramanujan’s 
16 further series formulas for 1/7, and many similar series representations for 1/7, 
in particular, one due to David and Gregory Chudnovsky that yields 14 digits of 
am per term. Papers numbered 10, 11, and 19 in this volume provide readers with 
more details about these formulas and their use for computing digits of 7. 
There are many further intriguing formulas for 7. One of the most popular is 
John Wallis’s formula (1655) 
22446 6. 
133557° 
My favorite is Lord Brouncker’s (1620-1684) continued fraction for 7, 
4 2 eR 
T= ' (2) 
Ll+-2+72 4+ 2 + 2 +00 
Brouncker was a student of Wallis, and mathematicians have speculated for many 
years on how Brouncker might have derived his formula (2). It is actually a special 
case of a more general continued fraction 


TG@tnt+)DTg@ nti) 4 nn? 1? nn? 3? n2_ 5? 
TG(c@+n+ 3) G(x n+3)) 2 2a Qa Qx eco 
which can be found as Entry 25 in Chapter 12 of Ramanujan’s second notebook. If 
we set « = 1 and n = 0 in (3), we obtain (2). 
Seemingly strange approximations to 7 can be obtained from class invariants. 


Let n be a positive integer, and set g = e~"V”. The class invariant G,, is defined 
by 


Ga = 2-49-41 + g)(1 4 9?)(1 + ¢°) eee. (4) 
Class invariants are algebraic numbers. Thus, taking the logarithm of both sides 
of (4) for a certain class invariant should produce an approximate formula for 7. 
For example, on page 300 in his second notebook, Ramanujan considered the cubic 
equation 
223 4a* +62 1=0, 
which has the real root 
xp = 0.188018406073701181981836762895 .... 
Note that the first 16 digits of 
24 log xo 
163 
agree with the first 16 digits of 


Tm = 3.141592653589793238 .... 


= 3.141592653589793245... 


FOREWORD xi 


It may come as a surprise to readers that the resurgence in calculating the 
digits of 7 arose from an adaptation of the arithmetic-geometric mean of Gauss by 
(individually) Eugene Salamin and Richard Brent. Readers can learn about these 
developments in the first five papers of this volume. 

Although we know a lot of digits of 7, there is much that we do not know about 
these digits. For example, we would conjecture that the average digit should be 
about 4.5, but we do not know how to prove this. More precisely, it is conjectured 
that 7 is normal, but evidently we are far from proving this as well. This volume 
contains papers by Stan Wagon (no. 6); by the present authors (no. 17); by Francisco 
Aragon Artacho, the present authors, and Peter Borwein (no. 21); and by the 
present authors (no. 23) bringing readers up to date on what we know about this 
famous conjecture. 

We have provided only a small sample of examples to illustrate the beauty 
and mystery of this remarkable number. Mathematicians and computer scientists 
continue to delve into the mysteries of 7. Our goal in the present volume is to bring 
together some of these investigations and thoughts about 7 from papers published 
during the past half century. Many of these papers can be read by a broad audience. 
For readers who want to read further, most of the 25 selected papers point to more 
technical articles. 


Bruce C. Berndt 


Preface 


This volume is a companion to Pi: A Source Book (by Lennart Berggren, Jonathan 
Borwein, and Peter Borwein, Springer-Verlag), which was first published in 1997, 
with a third edition released in 2004. 

Rather than produce an even heftier fourth edition, the current authors have 
prepared a collection of papers written between 1975 and the present. Since a 
number of the collected papers contain substantial historical material, the reader 
can glean an accurate picture of the life of 7 from the current volume. That said, 
the focus in this book is on “a in the digital age.” The reader will note that many of 
the papers have substantial algorithmic material and it is recommended that where 
possible he or she explore such material at the computer. 

Each of the 25 papers comprising this volume is preceded by a brief summary 
of its contents, and this is accompanied by a very brief, key word, indication of 
some of the ways the content of the given paper relates to that of others in the 
collection. This information is also recorded in Table 1 below. 

For the most part, however, we are happy to let the papers speak for themselves. 
The present authors have been fascinated by 7 throughout their academic lives and 
hope that this volume will help readers share this fascination and potentially even 
contribute to the ever-growing literature on the subject. The associated webpage 
for the collection is accessible through http: //www.springer.com/us/book/ 
9783319323756. 

We are also delighted that Bruce Berndt agreed to add a Foreword to the 
volume. 


Algorithms 1, 2, 4, 5, 8, 10, 11, 13, 14, 16, 17, 22, 23, 24, 25 


Arithmetic-geometric mean Vv 1, 2, 3, 4, 5,8 
Computation v 1, 2, 4, 6, 7, 8, 9, 11, 14, 17, 19, 20, 22, 24, 25 
Curiosities v 12, 21, 22 
Elliptic integrals v 1, 2, 3, 8, 11 
General audience Vv 6, 10, 23, 24 
Graphical representation Vv 21 
History v 3, 4, 8, 10, 11, 17, 19, 20, 22, 24, 25 
Approximations Vv 2,4, 18 
Irrationality Vv 15, 24, 25 
Modular equations v 10, 11 
Normality v 6, 7, 9, 17, 20, 21, 23 
Random walks v 21 
Series v 12, 19, 22 
y 


TABLE 1. Articles by Keyword 
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1. Computation of 7 using arithmetic-geometric 
mean (1976) 


Paper 1: Eugene Salamin, “Computation of pi using arithmetic-geometric mean,” 
Mathematics of Computation, vol. 30, no. 135 (July 1976), pg. 565-570. Reprinted 
by permission of the American Mathematical Society. 


Synopsis: 

In 1976, Eugene Salamin and Richard Brent independently discovered two 
equivalent “quadratically convergent” algorithms for computing 7, meaning that 
each iteration of the algorithm approximately doubles the number of correct digits 
in the result, provided each iteration is performed with a level of numeric precision 
that is desired for the final result. This remarkable co-discovery arguably launched 
the modern computer era of the computation of 7. 

Salamin’s statement of the algorithm could hardly be more concise: 


dagm?(1, V2) 
a 
‘ae Sere aod 


where agm(a, b) is the arithmetic-geometric mean (AGM) iteration: 


(5) 


an bn 
i= feck rc (6) 


bn = 9/Gn—1)n—1 (7) 


with ao and bo as the two input arguments, and c? = a2 — 02. 
However, this is just a special case of his Theorem 1a, which is that 
_ 4agm(1, k)agm(1, k’) 


or co , ? 
ae (Ge 


(8) 


for any k and k’ satisfy k? + kb’? = 1. 

Salamin does not report any computation in his paper. He only says that some 
colleagues are attempting to implement this algorithm on the Illiac IV computer, 
one of the first parallel processors, to 33 million bit (roughly 10 million digit) 
precision. The present authors are not aware of any report of this calculation, if it 
was successful. 


Keywords: Algorithms, Arithmetic-Geometric Mean, Computation, Elliptic Inte- 
grals 
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MATHEMATICS OF COMPUTATION, VOLUME 30, NUMBER 135 
JULY 1976, PAGES 565-570 


Computation of 7 Using Arithmetic-Geometric Mean 


By Eugene Salamin 


Abstract, A new formula for 7 is derived. It is a direct consequence of Gauss’ 
arithmetic-geometric mean, the traditional method for calculating elliptic integrals, 
and of Legendre’s relation for elliptic integrals. The error analysis shows that its 
rapid convergence doubles the number of significant digits after each step. The 
new formula is proposed for use in a numerical computation of 7, but no actual 


computational results are reported here. 


1. Introduction, This paper announces the discovery of a new formula for 7. 

It is based upon the arithmetic-geometric mean, a process whose rapid convergence 
doubles the number of significant digits at each step. The arithmetic-geometric mean, 
together with m as a known quantity, is the basis of Gauss’ method for the calculation 
of elliptic integrals. But with the help of an elliptic integral relation of Legendre, 
Gauss’ method can be turned around to express 7 in terms of the arithmetic-geometric 
mean. The resulting algorithm retains the property of doubling the number of digits 
at each step. 

The proof of the main result (Theorem 1a) from first principles can be conducted 
on the elementary calculus level. The references cited here for the theorems of Landen, 
Gauss and Legendre have been chosen to achieve this goal, thus allowing the widest 
possible reader audience comprehension. 

The formula presented in this paper is proposed as a method for the numerical 
computation of 7. It has not yet been tested on a calculation of nontrivial length, 
although such a calculation is currently in progress [2]. 


2. The Arithmetic-Geometric Mean. Let a9, bg, Co be positive numbers satisfy- 
ing aa = b3 + te Define a, the sequence of arithmetic means, and 5, the sequence 
of geometric means, by 


Oy =AGpy t+ By), By = Opp Ona)” 


nt 


Also, define a positive sequence c¢,: 


ae eer ey 
Ch =a, te 


Two relations easily follow from these definitions. 


q) on, = 2G, 4 7 b,,-1)- 


(2) = 4Ont1ensi: 
Received May 27, 1975; revised November 3, 1975. 
AMS (MOS) subject classifications (1970). Primary 10A30, 10A40, 33A25; Secondary 41A25, 
Key words and phrases. 1, arithmetic-geometric mean, elliptic integral, Landen’s transforma- 
tion, Legendre’s relation, fast Fourier transform multiplication. 
Copyright © 1976, American Mathematical Society 
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The arithmetic-geometric mean is the common limit 
agm (ap, by) = lim a, = lim b,. 


Because of the rapidity of convergence of the arithmetic-geometric mean, as exhibited 
by Eq. (2), the formula to be derived should be regarded as a plausible candidate for 
the numerical computation of 7. 


3. Elliptic Integrals. The complete elliptic integrals are the functions 
2 ; a 
K(k) = 6 qd - 2 sin? 4-4 dt, E(k) =fj a- 2 sin? 4)” dt. 


Also, if k? +k’? = 1, then K’(k) = K(k’) and E(k) = E(k’) are two more elliptic 
integrals. 
There is also a symmetric form of these integrals: 


2 
I(a, b) =f" (a? cos*t + b? sin?1)-” dt, 
m/2 
J(a, b) = fo (a? cos*t + b? sin?2)* dt. 


It is clear that 
Ka, b) =a'K'(bla),  J(a, b) = aE'(b/a). 
4. Landen’s Transformation and the Computation of Elliptic Integrals. Using 


the notation developed in Section 2 of this paper, these transformations are [6, 
Section 25.15], 


(3) IG@,, b,) I@i4y Daas 

(4) I@,; b,,) — 2 Gn + 1» bn+i) a GP Ln + 1) Dati) 
From Eq. (3) it follows that 

(5) I(@p, bo) = n/2agm(@p, bg), 


and, after a little work, Eq. (4) yields 


(6) Jag, by) = (< -% > 24) I(Gg, Bg). 
j=0 


For ay = 1, by =X’, the integrals in Eqs. (S) and (6) are equal to K(k) and E(k), 
respectively, while for ag = 1, by = k, they equal K’(k) and E'(k). This is the well-known 
method of Gauss for the numerical calculation of elliptic integrals [5, pp. 7880], [1, 
Section 17.6]. 

5. Legendre’s Relation. This relation is [4, Article 171], [1, Eq. 17.3.13], 

(7) K(K)E'(k) + K'(K)E(K) — K()K'(k) = 2/2. 
Equivalently, 
(8) a7I(a, b)J(a’, bY) + a’? Ha’, b')J(a, b) — a?a'2 Ka, b) a’, b') = (n/2)aa', 


where a, b, a’, b’ are subject to the restriction (b/a)? + (b'/a')? = 1. 
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6. New Expression for 7, Take ay = a) = 1, by =k, by =k’. As in Section 2, 
define the sequences a, b,,, C,, Bes bi, Ce In Eq. (8) eliminate the J integrals by use 
of (6), and then eliminate the J integrals by use of (5). Lo and behold, the resulting 
equation can be solved for 7! 

THEOREM la. 

4agm(1, k)agm(1, k’) 
@) _ 1 -> We? tery 
f#=1 I Fi 

The j = 0 term in the summation has been eliminated by use of cf + ¢? = 
k’? + k? = 1. It is best to compute c; from Eq. (1). 

Theorem la is a one-dimensional continuum of formulae for 7. This provides 
for an elegant and simple computational check. For example, 7 could be calculated 
starting with k = k’ = 2—%, and then checked using k = 4/5, k’ = 3/5. The symmetric 
choice, k = k’, causes the two agm sequences to coincide, thus halving the computa- 
tional burden. 

THEOREM 1b. 

_ Aagm(, 27)? 


t= ; 
1 ->” git1,2 
j=l I 


7. Error Analysis. Although Theorem 1a is true for all complex values of k 
(except for a discrete set), the error analysis will assume real k and kK’. Then 0 < 
k, k’ <1, Let n square roots be taken in the process of computing agm = agm(1, &), 
and n’ square roots in computing agm’ = agm(1, k’). Then no further square roots 
are needed to calculate the approximation 

- 44 1%n' 41 
10 nn eA he WA! ea 
(10) LS a = Ete el? 

A rough estimate shows that a,,, differs from agm by c, ,, and that the 
finite sum differs from the infinite sum by aa 42° Thus, the numerator and 
denominator in (10) have been truncated for compatible error contributions, and the 
denominator error is dominant. 

To obtain rigorous error bounds, introduce the auxiliary quantity 7,,,,1 whose 
denominator is taken from (10), but whose numerator is taken from (9). The first 


step is to establish the existence of e¢,,,1, €,,," Such that 


qi) 0<t- Tay? <pn's 


O< tan’ > Tan! <n 


(12) 


@ nat < enn’: 
These three inequalities imply that |w -a,,+|<e,,'. 
The left-hand inequalities in (11) and (12) are obvious. From the general inequal- 


ity (1/x) — (1/@ + »)) < y/x?, valid for positive x and y, it follows that 
2 


ae if kad . = - 
TW ng! <r ig? + ig?) | 
wi Taree (E26 + §24 
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This establishes (11), with error bound 


2 oo oo 
7 , . 
13 = Mae, + Qa'c', 
(13) 2° Sat (E, ogy + w) 
Proceeding to the next inequality, we first get 
1 


? , 
7 Gy. 14 y'4 1 7 ABM agm ). 
agm agm’ ~ @ tion +l 


Tent Fan’ © 


Substitute a,,., =agm +s, a,+,, = agm’ + s’, where 
ate : , 
s= > G, v= DB cd, 
j n+2 ni+Z 
and use agm <1, agm < 1 to get 


Trant ~ Mant <(s +8" + ss')/agm agm’. 


Also, since s <1, s’ <1, it follows that ss’ < (s + s')/2. Thus, inequality (12) is 
established with error bound 


3 T oo oo 
@e=- oo ce, + c; }. 
(14) enn’ ~ 5 agm agm’ es i 2 i) 
Finally, a term-by-term comparison of (13) and (14), using 2a, > 1 and aw > 3, shows 


that ean < Cnn’: 
At this point, a needed inequality is derived. 


a, <a, +b, = 2aj41> 
ay aa a 
2ajCp¢ 1 S4Qj4 G41 = = Gi — ae 


(15) ajc; + 2ej44) <4j_1G- 


Consider the first summation in (13), but with the upper limit o¢ replaced by 
finite VW. Perform the following sequence of operations, each of which increases the 
sum. First, replace ay by ay_,. Next, repeatedly apply (15) to the pair of highest- 
indexed terms in the sum. At the end, we are left with the single term 2"*7a,, 1¢,45 
< De ee: which is thus an upper bound for the initial summation. Since NV was 
arbitrary, the infinite summation also has this upper bound. Therefore, 

2 , 
(16) enn! < oa aan (2% naa + 2” Cytga)- 

An upper bound for ¢,,,, is needed now. It is convenient to use the abbrevia- 
tions 

X, =loge,, 8, = log(4a,). 


Equation (2) gives x, as the solution to an inhomogeneous linear difference equation. 


n > 
x, = 2" (x = > rin) 
IF 
By rearrangement, 
n-1 nA 
Xp 2" (xy ey + 22, -53.1) Ey. 
1 


j= 
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Using g; — )41 > 9, 8, <log 4, and x9 —g, = (1/2) log(c, /4a,), we get 


a7) Mie ml S 27* MNog(ajla;4. 1) — loti) + log 4. 
j=1 A 

For the purpose of an error analysis, the expression within brackets could be calculated 

numerically for any case of interest. However, it can be evaluated in closed form {7, 

p. 14] and is equal to — 7K'(k)/K(k) = - magm/agm’. Then 


x, <— m(agm/agm’)2"~! + log 4. 


Substituting this into (16) yields the final result. 
THEOREM 2a. 
81? a ’ agm’ ys 
la-a jo 2exp(—1 = ant \4 27 exp Ce or : 
nn agm agm agm agm 
In the symmetric case, with 1, = 7 
THEOREM 2b. 


nn> Theorem 2a simplifies to 


lIn-a,I1< (122"*4 Jasm?)exp(— 12"* 1), 


The number of valid decimal places is then 
THEOREM 2c. 


—log, 9 | — 1, | > (nflog 10)2”*? — nlog, 92 — 210g, 9(4a/agm). 


8. Numerical Computation. Raphael Finkel, Leo Guibas and Charles Simonyi 
are currently engaged in calculating 7 using the method proposed in this paper [2]. 
The operations of multiprecision division and square root are reduced to multiplication 
using a Newton’s method iteration, The multiplications are then performed by the 
Schénhage-Strassen fast Fourier transform algorithm [10], [8, p. 274]. The computa- 
tion, to be run on the Illiac IV computer, is expected to yield 33 million bits of 7 in 
an estimated run time of four hours. This run time is determined by disc input-output, 
and the actual computation is estimated to be only a couple of minutes. Alas, they do 
not plan to convert to decimal. 


9. Concluding Remarks. The main result of this paper, Theorem la, directly 
follows from Gauss’ method for calculating elliptic integrals, Eqs. (5) and (6), which 
was known in 1818 [3, pp. 352, 360], and from Legendre’s elliptic integral relation, 
Eq. (7), which was known in 1811 [9, p. 61]. It is quite surprising that such an 
easily derived formula for 7 has apparently been overlooked for 155 years. The author 
made his discovery in December of 1973. 

The series summation which was used to simplify Eq. (17) was also discovered 
by Gauss [3, p. 377]. An interesting consequence of this result of Gauss is that e7 
can be expressed as a rapidly convergent infinite product. If ag = 1, by = 2%, then 


66 > ~j+1 
e” = 32 J] (“e)’ ; 


Charles Stark Draper Laboratory 
Cambridge, Massachusetts 02139 
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Synopsis: 

As mentioned in the preface to Paper 1, Eugene Salamin and Richard Brent 
independently discovered two equivalent “quadratically convergent” algorithms for 
computing 7 in 1976, and this remarkable co-discovery arguably launched the mod- 
ern computer era of the computation of 7. 

Brent’s paper includes an equivalent statement of the Salamin’s 7 algorithm, 
but he goes further in showing how the same mathematical structure yields algo- 
rithms for evaluating all of the “elementary” algorithms, including exp, log, arctan, 
sin, cosh and others. 

Brent also included a detailed analysis of the computational complexity of these 
algorithms, showing how the functions above can be evaluated, with relative error 
O(2-"), in O(M(n) log(n)) operations, where M(n) is the cost of a multiplication. 
He then adds that in the wake of the Schonhage-Strassen result, namely that a mul- 
tiplication can be done with a fast Fourier transform, that this means all the elemen- 
tary functions can be evaluated to O(2~”) relative error in O(n log?(n) log log(n)) 
operations. 
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abstract. Let f(x) be one of the usual elementary functions (exp, log, artan, sin, cosh, ete.), and 
let Mf (n) be the number of single-precision operations required to multiply »-bit integers. It. is shown 
that f(z) can be evaluated, with relative error O(2~), in O(M (n)log (n)) operations as xn — ~, for 
any floating-point number 2 (with an »-bit fraction) in 4 suitable finite interval. From the Schonhage- 
Strassen bound on M(n), it follows that an n-bit approximation to f(z) may be evaluated 
in Of log?{n) log log{n)) operations. Special eases inelude the evaluation of constants such as 
x, ¢, and e*, The algorithms depend on the theory of elluptic integrals, using the arithmetic-geometric 
mean iteration and ascending Landen transformations. 


KEY WORDS AND PHRASES: multiple-precision arithmetic, analytie complexity, arithmetic-geometrie 
mnean, computational complexity, elementary function, elliptic integral, evaluation of x, exponential, 
Landen transformation, logarithm, trigonometric function 


OR CATEGORIES: 5.12, 5.15, 5.25 


1. Introduction 


We consider the number of operations required to evaluate the elementary functions 
exp(z), log(x),’ artan(z), sin(x), etc., with relative crror O(2-"), for x in some 
interval [a, 5], and large n. Here, (a, 6] is a fixed, nontrivial interval on which the relevant 
elementary function is defined. The results hold for computations performed on a multi- 
tape Turing machine, but to simplify the exposition we assume that a standard serial 
computer with 4 random-aeccess memory is used. 

Let M(x) be the number of operations required to multiply two integers in the range 
{0, 2'*'), We assume the number representation is such that addition can be performed in 
O(M(n)) operations, and that 14(7) satisfies the weak regularity condition 


Man) < BM(n), (1.1) 


for some a and 8 in (0, 1), and all sufficiently large n. Similar, but stronger, conditions 
are usually assumed, either explicitly [11] or implicitly {15}. Our assumptions are cer- 
tainly valid if the Schénhage-Strassen method [15, 19] is used to multiply n-bit integers 
(in the usual binary representation) in O(n log(n) log log(”)) operations. 

The elementary function evaluations may be performed entirely in fixed point, using 
integer arithmetic and some implicit scaling scheme. However, it is more convenient to 
assume that floating-point computation is used. For example, a sign and magnitude 
representation could be used, with a fixed length binary exponent and an n-bit binary 
fraction. Our results are independent of the particular floating-point number system 
used, so long as the following conditions are satisfied. 

Copyright © 1976, Association for Computing Machinery, Inc. General permission to republish, 
but not for profit, all or part of this material is granted provided that ACM’s copyright notice is 


given and that reference is made to the publication, to its date of issue, and to the fact that reprinting 
privileges were granted by permission of the Association for Computing Machinery. 
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1 Log(z) denotes the natural logarithm. 


Journal of tho Association for Computing Machinery, Vol. 23, No. 2, April 1976, pp. 242-251. 


11 


12 2. FAST MULTIPLE-PRECISION EVALUATION OF ELEMENTARY FUNCTIONS (1976) 


Fast Multiple-Precision Evaluation of Elementary Functions 243 


1. Real numbers which are not too large or small can be approximated by floating- 
poimt numbers, with a relative error O(2"*). 

2. Floating-point addition and multiplication can be performed in O(Af(n)}) opera- 
tions, with a relative error O(2"") in the result. 

3. The precision ” is variable, and a floating-point number with precision n may be 
approximated, with relative error O(2™) and in O(M(n)) operations, by a floating- 
point number with precision m, for any positive m <n. 

Throughout this paper, a floating-point number means a number in some representation 
satisfying conditions 1 to 3 above, not a single-precision number. We say that an opera- 
tion is performed with preciszon n if the result is obtained with a relative error O(2°"). 
It is assumed that the operands and result are approximated by floating-point numbers. 

The main result of this paper, established in Sections 6 and 7, is that all the usual 
elementary functions may be evaluated, with precision », in O( M(n) log(n)) operations. 
Note that O(M(n)n) operations are required if the Taylor series for log(1 + 2) is 
summed in the obvious way. Our result improves the bound O(M(n) log’(n)) given in 
[4], although the algorithms described there may be faster for small n. 

Preliminary results are given in Sections 2 to 5. In Section 2 we give, for completeness, 
the known result that division and extraction of square roots to precision » require 
O(M(n)) operations. Section 3 deals briefly with methods for approximating simple 
zeros of nonlinear equations to precision n, and some results from the theory of elliptic 
integrals are summarized in Section 4. Since our algorithms for elementary functions 
require a knowledge of 7 to precision n, we show, in Section 5, how this may be obtained 
in O(M(n) log(n)) operations. An amusing consequence of the results of Section 6 is 
that e” may also be evaluated, to precision n, in O( M(n) log(n)) operations. 

From (4, Th. 5.1], at least O(4f(n)) operations are required to evaluate exp(x) or 
sin(z) to precision n. It is plausible to conjecture that O(Af(n) log(n)) operations are 
necessary. 

Most of this paper is concerned with order of magnitude results, and multiplicative 
constants are ignored. In Section 8, though, we give upper bounds on the constants. 
From these bounds it is possible to estimate how large 7 needs to be before our algorithms 
are faster than the conventional ones. 

After this paper was submitted for publication, Bill Gosper drew my attention to 
Salamin’s paper [18], where an algorithm very similar to our algorithm for evaluating a 
is described. A fast algorithm for evaluating log(z) was also found independently by 
Salamin (sce [2 or 5]). 

Apparently similar algorithms for evaluating elementary functions are given by 
Borchardt [3], Carlson {8, 9], and Thacher {23]. However, these algorithms require 
O(M(n)n) or O(M(n)n') operations, so our algorithms are asymptotically faster. 

We know how to evaluate certain other constants and functions almost as fast as 
elementary functions. For example, Euler’s constant y = 0.5772 ... can be evaluated 
with O(M(n) log’ 2) operations, using Sweeney’s method [22] combined with binary 
splitting (4). Similarly for '(@), where a is rational (or even algebraic): see Brent. [7]. 
Related results are given by Gosper [13] and Schroeppel [20]. It is not known whether 
any of these upper bounds are asymptotically the best possible. 


2. Reetprocals and Square Roots 


In this section we show that reciprocals and square roots of floating-point numbers may 
be evaluated, to precision », in O(M(n)) operations. To simplify the statement of the 
following lemma, we assume that M(x) = 0 for all z < 1. 

Lemma 2,1. If-y € (0,1), then Dino M(y’n) = O(M(n)) asn— &, 

Proor. If wand Bare asin (1.1), there exists k such that-y* < a. Thus, Di M(y'n) 
< kD M(a’n) < kM(n)/(1 — 8) + O(1), by repeated application of (1.1). Since 
Min) > © asn— o@, the result follows. 
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In the following lemma, we assume that 1/c is in the allowable range for floating-point 
numbers. Similar assumptions are implicit below. 

Lumma 2.2. If c ts a nonzero floating-point number, then 1/c can be evaluated, to pre- 
cision n, in O( M(n)) operations. 

Proor. The Newton iteration 


Liqa = 2.(2 ~ e2;) (2.1) 


converges to 1/e with order 2. In faet, if x, = (1 — ¢,}/c, substitution in (2.1) gives 
é41 = €,. Thus, assuming || < 3, we have je,| < 2” for all ¢ > 0, and 2; is a suffi- 
ciently good approximation to 1/c if k > loge 2. This assumes that (2.1) is satisfied 
exactly, but it is easy to show that it is sufficient to use precision n at the last iteration 
(2 = k — 1), precision slightly greater than x/2 for 7 = k ~ 2, ete. (Details, and more 
efficient methods, are given in [4, 6].) Thus the result follows from Lemma 2.1. Since 
a/y = x(1/y), it is clear that floating-point division may also be done in O(M(n)) 
operations. 

Lemma 2.3. If ec > 0%s @ floating-point number, then c! can be evaluated, to precision n, 
in O(M(2)) operations. ; 

Proor. Ife = 0 then c! = 0. If c + 0, the proof is similar to that of Lemma 2.2, 
using the Newton iteration 2.41 = (2, + ¢/2z,)/2. 

Lemma 2.4. Forany ficedk > 0, M(kn) = O(M(n)) asn— =. 

Proor. Since we can add integers less than 2” in O(M(7n)) operations, we can add 
integers Jess than 2°” in O(kM(n)) = O(M(n)) operations. The multiplication of 
integers less than 2°" can be split into O(%*) multiplications of integers less than 2°, 
and O(k’) additions, so it ean be done in O(k"M(n)) = O(M(n)) operations. 


3. Solution of Nonlinear Equations 


In Section 6 we need to solve nonlinear equations to precision n. The following lemma is 
sufficient for this application. Stronger results are given in [4, 6]. 

Lemma 3.1. If the equation f(x) = ¢ has a sumple root ¢ x4 0, f' 18 Lipschitz continuous 
near {, and we can evaluate f(z) to precision n in O(M(njo(n)) operations, where o(n) 
is a positive, monotonic wncreasing Junction, for x near §, then ¢ can be evaluated to precision 
n in O(M(n)o(n)) operaizons. 

Proor. Consider the discrete Newton iteration 


tn = & — hPa.) — €)/(F(e + hi) — f(a.)). (3.1) 


Ifh, = 2°77, 2, — ¢ = 0(2°””), and the right side of (3.1) is evaluated with precision 
n, then a standard analysis shows that 2.41 — ¢ = O(2”"). Since a sufficiently good 
starting approximation z¢ may be found in O(1) operations, the result follows in the 
same way as in the proof of Lemma 2.2, using the fact that Lemma 2.1 holds with 4f(7) 
replaced by M(n)é(n). The assumption {+0 is only necessary because we want to 
obtain ¢ with a relative (not absolute) error O(2°”). 

Other methods, e.g. the secant method, may also be used if the precision is increased 
appropriately at each iteration. In our applications there is no difficulty in finding a 
suitable initial approximation 2» (see Section 6). 


4. Results on Elliptic Integrals 

Tn this section we summarize some classical results from elliptic integral theory. Most of 
the results may be found in [1], so proofs are omitted. Elliptic integrals of the first and 
second kind are defined by 


¥ 
F(y,a) = (1 — ginta sin’) *d0 (4.1) 


. 
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and 
¥ 
Bly, @) = i (1 — sinter sin6)!08, (4.2) 


respectively. For our purposcs we may assume that a and y are in {0, 7/2]. The complete 
elliptic integrals, F(w/2, a) and E(#/2, «), are simply written as F(a) and E(a), 
respectively. 
Legendre’s Relation. We need the identity of Legendre [17]: 
E(a)F(a2/2 — a) + E(r/2 — a) F(a) — F(a) P(a/2 — a) = 2/2, (4.8) 


and, in particular, the special case 


QE( x/4)F( 1/4) — (F(t/4))? = 2/2. (44) 
Small Angle Approximation. From (4.1) it is clear that 
F(¥, @) = ¥ + O(a") (4.5) 
as a0. 
Large Angle Approximation. From (4.1), 
Ply, a) = P(Y, 4/2) + O(4/2 ~ @)’, (4.6) 
uniformly for 0 < ¥ < Yo < 7/2, as a 7/2. Also, we note that 
F(¥, 7/2) = log tan( 7/4 + ¥/2). (4.7) 
Ascending Landen Transformation. Jf0 < a. < angi < 7/2, 0 < far < fs < 7/2, 
sin a, = tan’(a,4:/2), (4.8) 
and 
sin(2¥.41 — ¥.) = sin a, sin y,, (4.9) 
then 
FY, G41) = [(1 + sin a)/2]F(vs , a). (4,10) 
If s, = sin a, and v, = tan(¥./2), then (4.8) gives 
Sinn = 26'/(1 + 8), (4.11) 
and (4.9) gives 
D4 = w,/(1 + (1 + ws"), (4.12) 
where 
ws = tan figs = (0, + we) /(1 — ver), (4.13) 
w, = tan(War — 4/2) = wi/(L + (1 — wr)'), (4.14) 
and 
w1 = sin(Qhiu — ¥,) = 23u./(1 + 2). (4.15) 


Arithmetic-Geometric Mean Iteration. From the ascending Landen transformation 
it is possible to derive the arithmetic-geometric mean iteration of Gauss [12] and La- 
grange [16]: if a = 1, & = cosa > 0, 


O41 = (a, + 8,)/2, (4.16) 
and 
bier = (a,b,)}, (4.17) 
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then 
lim a, = 1/[2F(a)]. (4.18) 
Also, if ¢ = sin a@ and 
Ca. = 4, — Ga, (4.19) 
then 
i) 
E(a)/Fla) = 1~ 32°76, (4.20) 
1=0 


An Infinite Product. Let s,, a,, and b, be as above, with a = 7/2 — ay, 80 % = 
bo/ao. From (4.11), (4.16), and (4.17), it follows that s, = 6,/a, for all 2 > 0, Thus, 
(1 + s,)/2 = a.yi/a., and 

Ls 

0 [1 + 3.)/2] = hm a = 2/(2F (2/2 — ay)] (4.21) 

1=0 1-0 
follows from (4.18). (Another connection between (4.11) and the arithmetie-geometric 
mean iteration is evident if s9 = (1 — by’'/ao’)*. Assuming (4.11) holds for 1 < 0, it 
follows that s., = (1 — b’/a.)? for all 7 > 0. This may be used to deduce (4.18) from 
(4.10).) 


5. Hvaluation of 3 


Let @ = 1, & = & = 7, A = hm... @,, and 7 = lim... é,, where a. , 6, , and ¢, are 
defined by (4.16), (4.17), and (4.19) for? > 1, and t, = 4 — Dopo 2? 7c,”. From (4.4), 
(4.18), and (4.20), we have 


x= A/T, (5.1) 


Since a, > & > O for allz > 0, and Gui = @. — Qn = Qe ~ 0, (4.17) gives ds 
= fl@aa + Cut) (Qua —~ Gyr)! = Qua — O41), 80 C42 = Ole41). Thus, the process 
converges with order at least 2, and log, n + O(1) iterations suffice to give an error 
0(2-") in the estimate of (5.1). A more detailed analysis shows that a? .4/t, < 7 < a,/t, 
for all 2 > 0, and also a//t, — 7 ~ Smrexp(—2*r) and at — @au/t ~ 
a2'*4 exp( ~2°'a) as i> ©. The speed of convergence is illustrated in Table I. 

From the discussion above, it is clear that the following algorithm, given in pseudo- 
Algol, evaluates 7 to precision 7. 


Algorithm for x 
Ac1,Be24;7—3;X <1, 
while 4 — B> 2" do 
begin Y — A; A ~ (A + B)/2; B— (BY); 
T—T—- X(A — VY); X ~ 2X 
end; 


return A?/T for, better, (A + B)?/(47)]. 


TABLE [. Convercence oF 
APPROXIMATIONS TO € 


1 BR — O74/t af/t,~— x 
0 2.3’-1 8.6'~1 
1 1.0’-3 4.6’—-2 
2 7.4/-§ 8 8-5 
3 1.8’—19 3.1/~10 
4 §.8/—41 3.7'~21 
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Since logan + O(1) iterations are needed, it is necessary to work with precision ++ 
O(log log(n)), even though the algorithm is numerically stable in the conventional 
sense. From Lemmas 2.2-2.4, each iteration requires O(M(n)) operations, so 7 may be 
evaluated to precision n in O( M(x) log(v)) operations. This is asymptotically faster 
than the usual O(n’) methods (14, 21] if a fast multiplication algorithm. is used. A high- 
precision computation of a by a similar algorithm is described in [10]. Note that, because 
the arithmetic-geometric mean iteration is not self-correcting, we cannot obtain a bound 
O(M(n)) in the same way as for the evaluation of reciprocals and square roots by New- 
ton’s method. 


6. Evaluation of exp(x) and log(x) 

Suppose 6 > 0 fixed, and m € [6, 1 — 4]. If sin ay = m!, we may evaluate F( ay) to pre- 
cision n in O(M(n) log(n)) operations, using (4.18) and the arithmetic-geometrie 
mean iteration, as for the special case (2/4) described in Section 5. (When using (4.18) 
we need 7, which may be evaluated as described above.) Applying the ascending Landen 


transformation (4.8)-(4.10) with: = 0,1, ---,k — land yo = 1/2 gives 
kot 
(ds, ax) = {ZU + sina,)/2}} Pleo. (61) 


Since 8) = sin a = m! > 8! > 0, it follows from (4.11) that s, > 1 as i ©. In fact, 
ifs, = 1—e,, then ey = 1 — sy. = 1 — 21 — «)'/(2 — &) = €2/8 + Oe,), 80 
s,—> 1 with order 2. Thus, after k ~ logen iterations we have ¢, = O(2°"), so 7/2 — a 
= 0(2-”*) and, from (4.6) and (4.7), 

F(be , &) = log tan(m/4 + y/2) + O(2"). (6.2) 


Assuming k > 0, the error is uniformly O(2"") for all m € [6,1 — 4], since wy < yi < 1/2. 
Define the functions 


Unie {> (a+ sina,)/2}¢ F(a) (6.3) 


and 
T(m) = tan(7/4 + %o/2), (6.4) 


where ¥.. = lim... ¥.. Since s, — 1 with order 2, the infinite product in (6.3) is con- 
vergent, and U(m) is analytic for all m € (0, 1). Taking the limit in (6.1) and (6.2) 
as n (and hence k) tends to » , we have the fundamental identity 


U(m) = log T(m). (6.5) 


Using (4.11)-(4.15), we ean evaluate U(m) = {[]E3[(1 + s,)/2]}F(a0) + O(2°-") 
and T(m) = (1 + »%)/(1 — 4%) + O(2), to precision x, in O(M(n) log(n)) opera- 
tions, The algorithms are given below in pseudo-Algol. 


Algorithm for U(m) 
A-1,;B<— (1 ~— m)s; 
while A — B > 2*%do 

begin C — (A + B)/2; B — (AB)!; A — C end; 
Ae x/(A + B); Se mi; 
while 1 — § > 27% do 

begin A — A(I + S)/2; S — 2S1/(1 + S) end; 
return A(1 + S)/2. 


Algorithm for T(m) 


Veo1;S<_mi; 
while 1— 8S > 2“ do 
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begin W — 28V/(1 + V4); 
Wew/d + 2 — W)); 
We (V+ W)/0 - VW); 
Ve W/1+ 2+ W)); 
8 + 284/(1 + 8) 
end; 
return (1 + V)/(1 — VY). 


Properties of U(m) and T(m). From (4.21) and (6.3), 
U(m) = (2/2) F(ee)/F( 2/2 — a), (6.6) 


where sin a = m! as before. Both F(a) and F(7/2 — a») may be evaluated by the 
arithmetic-geometric mean iteration, which leads to a slightly more efficient algorithm 
for U(m) than the one above, because the division by (1 + S) in the final “while” 
loop is avoided. From (6.5) and (6.6), we have the special cases U(4) = 2/2 and T(4) 
= e*”, Also, (6.6) gives 


U(m)U(Q1 — m) = 2°/4, (6.7) 


for all m € (0,1). 

Although we shall avoid using values of m near 0 or 1, it is interesting to obtain asymp- 
totic expressions for U(m) and T'(m) as m — 0 or 1. From the algorithm for T(m), 
T(1 — €) = 4€? — A + O(e) as € > O. Thus, from (6.5), U(1 — 6) = Le) — €/4 
+ O(é), where L(e) = log (4/e). Using (6.7), this gives U(e) = 2° /[4AL(€)} + Ofe/L’), 
and hence T(e) = exp(2"/[4L(€)]) + O(¢/L’). Some values of U(m) and T(m) are 
given in Table IT. 

Evaluation of exp(x). To evaluate exp(z) to precision n, we first use identities such 
as exp(27) = (exp(x))? and exp(—x) = 1/exp(x) to reduce the argument to a 
suitable domain, say 1 < x < 2 (see below). We then solve the nonlinear equation 


U(m) = 2, (6.8) 


obtaining m to precision n, by a method such as the one described in Section 3. From 
Lemma 3.1, with ¢(n) = log(n), this may be done in O(M(n)log(n)) operations. 
Finally, we evaluate T(m) to precision », again using O(M(n)log(7)) operations. 
From (6.5) and (6.8), T(m) = exp(z), so we have computed exp(«) to precision 7. 
Any preliminary transformations may now be undone. 

Evaluation of log(x)}. Since we can evaluate exp(z) to precision n in O(M(n) log(n)) 
operations, Lemma 3.1 shows that we can also evaluate log(x) in O(M(n) log(n)) 
operations, by solving the equation exp(y) = x to the desired accuracy. A more direct 
method is to solve T(m) = x (after suitable domain reduction), and then evaluate U(m). 

Further details. If x € [1, 2] then the solution m of (6.8) lies in (0.10, 0.75), and it 
may be verified that the secant method, applied to (6.8), converges if the starting ap- 
proximations are mp = 0.2 and m, = 0.7. If desired, the discrete Newton method or 
some other locally convergent method may be used after a few iterations of the secant 
method have given a good approximation to m. 


TABLE I. Tae Funcrions U(m) anp Tm) 


m U(m) T(m) m Um) Tim) 
0 01 0 6693 1.9529 0.60 1.7228 5.6004 
0.05 0.8593 2.3615 0.70 1.9022 6 6999 
0.10 0.9824 2.6710 0.80 2.1364 8.4688 
0,20 1.1549 3.1738 0.90 2.5115 12.3235 
0.30 1.2972 3.6591 0.95 2.8714 17.6617 
0.40 1.4322 4,1878 0.99 3.6864 39.8997 
0.50 1.5708 4.8105 
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Similarly, if x € [3, 9], the solution of T7(m) = a lies in (0.16, 0.83), and the secant 
method converges if mp = 0.2 and m = 0.8. 
If x = 1 + e where € is small, and for domain reduction the relation 


log(z) = log(Ax} — log(d) (6.9) 


is used, for some d € (3, 9), then log(Az) and log(A) may be evaluated as above, but 
cancellation in (6.9) will cause some loss of precision in the computed value of log(z). If 
je] > 2°”, it is sufficient to evaluate log(Az) and log(\) to precision 2n, for at most n 
bits are lost through cancellation in (6.9). On the other hand, there is no difficulty if 
Jel < 2°”, for then log(1 + €) = «(1 + O(2-")). When evaluating exp(x), similar 
loss of precision never occurs, and it is sufficient to work with precision 2 + O(log log(n)), 
as in the evaluation of r (see Section 5). To summarize, we have proved: 

THrEoREM 6.1, If —« <a<b < &, then O(M(n) log(n)) operations suffice to evatu- 
ate exp(x) to precision n, uniformly for all floating-point numbers x € {a, bl, as n > ~; 
and similarly for log(x) if a > 0. 


7. Evaluation of Trigonometric Functions 


Suppose 6 > 0 fixed, and x € [6, 1]. Let s = sin a = 2°"? and m = tan(fo/2) = 
2/(1 + (1 + 2°)}), so tan w = 2. Applying the ascending Landen transformation, as 
for (6.1), gives 


kak 
F(t, on) = {0 [a+ s)/alt F(Wo, av). (71) 


Also, from (4.5) and the choice of 3), 
F(¥o, @} = artan(z) + O(2"). (7.2) 


From (4.11), 3.41 > s, so there is some 7 < logon + O(1) such that s, € [4, 4]. Since 
8, > 1 with order 2, there is some k < 2 logon + O(1) such that 1 — s, = O(27"). From 
(4.6) and (4,7), F(¥e, a) = log tan(w/4 + ¥/2) + 0(2"). Thus, from (7.1) and 
(7.2), 


k-1 
aati {i 2/1 + 23} oe tan(/4-+ ¥r/2) + O12). (7.8) 


Tf we evaluate tan(1/4 + ;,/2) as above, and use the algorithm of Section 6 to evaluate 
the logarithm in (7.3), we have artan(x) to precision » in O(M(n) log(x)) operations. 
The algorithm may be written as follows. 


Algorithm for artan(z), 2 € {6, 1] 
Sao? Veo a/l + (+ 2"); O- 1; 
while 1 ~ S > 2% do 
begin Q — 20/(1 +S); 
W — 28V/01 + V9; 
We W/( + (1 — W); 
W-(V + W)/G — VW); 
V—W/ + (1 + W*)4); 
S§ — 281/(1 +5) 
end; 


return Q log(Q + V)/1 — V)). 


After k iterations, Q < 2", so at most 2 logyn + O(1) bits of precision are lost because V 
is small. Thus it is sufficient to work with precision n + O (log (7)), and Lemma 2.4 justi- 
fies our elaim that O( M(n) log (n)) operations are sufficient to obtain artan (x) to pre- 
cision 7. 

If x is small, we may use the same idea as that described above for evalu- 
ating log(1 + €): work with precision 3n/2 + O(log(n)) if z > 2°”, and use artan(2) 
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=211+02"))t0<2<27, (Actually, it is not necessary to increase the working 
precision if log((1 + V)/(1 — V)) is evaluated carefully.) 

Using the identity artan(z) = 2/2 — artan(1/z) (« > 0), we can extend the do- 
main to [0, © ). Also, since artan(—z) = —artan(z), there is no difficulty with negative 
az. To summarize, we have proved the following theorem. 

Turorem 7.1. O(Af(n) log(n)) operations suffice to evaluate artan(x) to precision n, 
uniformly for all floating-point numbers x, asn—- %. 

Suppose @ € {5, 7/2 — 4]. From Lemma 3.1 and Theorem 7.1, we can solve the equa- 
tion artan(z) = 6/2 to precision » in O(M(n) log(n)) operations, and thus evaluate 
x = tan(@/2). Now sin @ = 22/(1 + 2°) and cos # = (1 — x°)/(1 +2") may easily be 
evaluated. For arguments outside [6, 7/2 — 6], domain reduction techniques like those 
above may be uscd. Difficulties occur near certain integer multiples of +/2, but these 
may be overcome (at least for the usual floating-point number representations) by in- 
creasing the working precision. We state the following theorem for sin(x), but similar 
results hold for the other trigonometric functions (and also, of course, for the elliptic 
integrals and their inverse functions). 

Turorem 7.2. If la, b] G (—72, 7), then O(M(n) log(n)) operations suffice to evaluate 
sin(x) to precision n, uniformly Jor all floating-point numbers x & (a, bj, asn— @, 


8. Asymptotic Constants 


So far we have been concerned with order of magnitude results. In this section we give 
upper bounds on the constants K such that w(n) < (K + 0(1))M(n) logun, where w(n} 
is the number of operations required to evaluate 7, exp(x), etc., to precision , The fol- 
lowing two assumptions will be made. 

1. For ally > O and ¢« > 0, the inequality M(yn) < (vy + €)M(m) holds for suffi- 
ciently large x. 

2. The number of operations required for floating-point addition, conversion between 
representations of different preeision (at most 2), and multiplication or division of 
floating-point numbers by smail integers is o(Af(n)) asn~— &. 

These assumptions certainly hold if a standard floating-point representation is used and 
M(n) ~ n (log())" (log log(n))® for some a > 0, provided B > Oif a = 0. 

The following result is proved in [4]. The algorithms used are similar to those of See- 
tion 2, but slightly more efficient. 

TreEoreM 8.1. Preciston-n division of floating-point numbers may be performed in 
(4 + o(1))M(n) operations as n > ©, and square roots may be evaluated in (11/2 + 
6(1))M(n) operations. 

Using Theorem 8.1 and algorithms related to those of Sections 5-7, the following re- 
sult is proved in {5}. 

THEOREM 8.2. a may be evaluated to precision n 1m (15/2 + 0(1))M(n) logon opera- 
tionsasn— «©. If 3 and log 2 are precomputed, the elementary functeon f(x) can be evalu- 
ated to precision nin (K + 0(1))M(n) logan operations, where 


We Vee af f(x) = log(x) or exp(z), 
~ \34 if f(z) = arten(x), sin(2), ete., 


and x ts a floating-point number un an interval on which f(x) ts defined and bounded away 
from O and «., 

For purposes of comparison, note that evaluation of log(l + x) or log((1 + 2)/ 
(1 — x)) by the usual series expansion requires (c + o(1))M({n)n operations, where ¢ 
is a constant of order unity (depending on the range of x and the precise method used). 
Since 13 logen < 2 for n > 83, the O(-M(n) log(7)) method for log(2) should be faster 
than the O(.f(n)n) method for greater than a few hundred. 


ACKNOWLEDGMENTS. This paper was written while the author was visiting the Com- 


19 


20 2. FAST MULTIPLE-PRECISION EVALUATION OF ELEMENTARY FUNCTIONS (1976) 


Fast Multiple-Precision Evaluation of Elementary Functions 251 


puter Science Department, Stanford University. The comments of R.W. Gosper, D.E. 
Knuth, and D. Shanks on various drafts were very useful. 


REFERENCES 
1 Asramowirz, M., anp Srecun, 1A. Handbook of mathematical functions with formulas, 
graphs, and mathematical tables. National Bureau of Standards, Washington, D.C., 1964; 
Dover, 1965, Ch. 17. 
2 Bercer, M, Gosper, R.W , anp ScHrogppet, R Hakmem Memo No. 239, M.I.T. Artificial 
Intelligence Lab., MI T., Cambridge, Mass., 1972, pp. 70-71 
3. BorcHarpt, CW. Gesammelie Werke. Berlin, 1888, pp. 455-462 
4. Brent, R P. The complexity of multiple-precision arithmetic. Proce Semmar on Complexity of 
Computational Problem Solving (held at Australian National U., Dec. 1974), Queensland U. 
Press, Brisbane, Australia 1975, pp 126-165 
5. Brent, R P. Multiple-precision zero-finding methods and the complexity of elementary func- 
tion evaluation Proc Symp. on Analytic Computational Complexity, J.F. Traub, Ed , Aca- 
demic Press, New York, 1976, pp 151-176. 
6, Brent, R.P. Computer Solution of Nonlinear Equateons. Academic Press, New York (to ap- 
pear), Ch. 6. 
7. Brent, RP. A Fortran multiple-precision arithmetic package. Submitted to a technical your- 
nal. 
8. Caruson, B.C Algorithms involving arithmetic and geometric means. Amer. Math. Monthly 
78 (May 1971), 496-505. 
9 Carison, BC. An algornthm for computing logarithms and arciangents. Math. Comput. 26 
(April 1972), 543-549. 
10. Finke., R ,Gorpas,L ,anp Srmony1,C Manuscript in preparation. 
WU. Fiscusr,M J, ano Srockmerar, LJ Fast on-line integer multiplication. J. Comput. System 
Scis, 9 (Dec. 1974), 317-331 
12. Gauss,C.F Carl Friedrich Gauss Werke, Bd. 3 Gottingen, 1876, pp. 362-403. 
13. Gosper, R.W. Acceleration of series. Memo No 304, M.I.T. Artificial Intelligence Lab., 
M1.T , Cambridge, Mass , 1974. 
14. GuiLioup, J.,anp Bovyer,M — 1,000,000 decimals de pi. Unpublished manuscript 
13. Kwurn, D.E. The Art of Computer Programming, Vol. 2 Addison-Wesley, Reading, Mass , 
1969. Errata and addenda: Rep. CS 194, Computer Sci Dep., Stanford U., Stanford, Calif., 1970. 
16. Lacraner, J.L. Oeuvres de Lagrange, Tome 2 Gauthier-Villars, Parts, 1868, pp. 267-272. 
17. Lecznpre, A.M Ezercices de Calcul Integral, Vol. {. Paris, 1811, p. 61. 
18. Sanamin, E. Computation of + using srithmetic-geometric mean. To appear in Math. Comput. 
19 Scwonnace, A., AnD Srrassen, V. Schnelle Multiphkation grosser Zahlen. Computing 7 
(1971), 281-292. 
20. Scuroerpe., R. Unpublished manuscript dated May 1975 
21) Swans, D.,anp Wrencu, I.W. Calculation of x to 100,000 deesmals. Math Comput. 16 (1962), 
76-99, 
22 Sweenry, DW. On the computation of Euler’s constant Math. Comput. 17 (1963), 170-178. 
23 TuHacuer, H.C. Iterated square root, expansions for the inverse cosine and inverse hyperbolic 


cosine. Math. Comput. 16 (1961), 399-403. 


RECEIVED MARCH 1975; REVISED SEPTEMBER 1975 


Journal of the Association for Computing Machinery, Vol. 23, No. 2, April 1976 


3. The arithmetic-geometric mean of Gauss (1984) 


Paper 3: David A. Cox, “The arithmetic-geometric mean of Guass,” L Enseignement 
Mathematique, vol. 30 (1984), p. 275-330. Reprinted by permission. 


Synopsis: 

Although the arithmetic-geometric mean (AGM) is now used widely in high- 
precision computation, it was actually discovered nearly a century earlier, indepen- 
dently by Lagrange, Legendre and, in considerably more detail, by Gauss (although 
there is no evidence that any of them saw the connection to computing 7). As David 
A. Cox points out in this article, Gauss did numerous computations with the AGM, 
such as when he numerically discovered that 


T 
agm(V2, 1) = 1.19814023473355922074... ; i ove =i 

a fact that Gauss declared in his workbook “will surely open an entirely new field 
of analysis.” Indeed it did, as Cox describes in considerable detail. 

As Cox observes, Gauss’ analyses of the AGM for complex arguments are par- 
ticularly interesting. In this case, the square root in the defining formulas of the 
AGM leads to multiple-valued complex functions, and is not at all clear which of 
these branches one should take, nor is it clear whether any of these branches con- 
verge. But Gauss developed a rigorous theory of the complex AGM, and further 
saw connections to the theory of elliptic modular functions. Gauss’ work in this 
area was almost completely unknown to others during his lifetime, but today is 
recognized as a major contribution to the field. 


Keywords: Arithmetic-Geometric Mean, Elliptic Integrals, History 
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THE ARITHMETIC-GEOMETRIC MEAN OF GAUSS 


by David A. Cox 


INTRODUCTION 


The arithmetic-geometric mean of two numbers a and b is defined to be 
the common limit of the two sequences {a,}*> and {b,}% 9 determined 
by the algorithm 


ag = a, bo = b ’ 
(0.1) 
An+t = (a, + b,)/2 ’ Baa = (a,b,)'!? * n= 0, i, 2, wes 


Note that a, and b, are the respective arithmetic and geometric means of a 
and b, a, and b, the corresponding means of a, and h,, etc. Thus the limit 
(0.2) M(a,b) = lim a, = lim 45, 

really does deserve to be called the arithmetic-geometric mean of a and b. 
This algorithm first appeared in a paper of Lagrange, but it was Gauss who 
really discovered the amazing depth of this subject. Unfortunately, Gauss 
published little on the agM (his abbreviation for the arithmetic-geometric 
mean) during his lifetime. It was only with the publication of his collected 
works [12] between 1868 and 1927 that the full extent of his work became 
apparent. Immediately after the last volume appeared, several papers (see [15] 
and [35]) were written to bring this material. to a wider mathematical 
audience. Since then, little has been done, and only the more elementary 
properties of the agM are widely known today. 

In § 1 we review these elementary properties, where a and b are positive 
real numbers and the square root in (0.1) is also positive. The convergence 
of the algorithm is easy to see, though less obvious is the connection 
between the agM and certain elliptic integrals. As an application, we use 


M(,/2, 1) to determine the arc length of the lemniscate. In § 2, we allow a 
and 6 to be complex numbers, and the level of difficulty changes dramatically. 
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The convergence of the algorithm is no longer obvious, and as might be 
expected, the square root in (0.1) causes trouble. In fact, M(a, b) becomes a 
multiple valued function, and in order to determine the relation between the 
various values, we will need to “uniformize” the agM using quotients of the 
classical Jacobian theta functions, which are modular functions for certain 
congruence subgroups of level four in SL(2, Z). The amazing fact is that 
Gauss knew all of this! Hence in §3 we explore some of the history of 
these ideas. The topics encountered will range from Bernoulli's study of elastic 
rods (the origin of the lemniscate) to Gauss’ famous mathematical diary 
and his work on secular perturbations (the only article on the agM published 
in his lifetime). 

I would like to thank my colleagues David Armacost and Robert Breusch 
for providing translations of numerous passages originally in Latin or 
German. Thanks also go to Don O’Shea for suggesting the wonderfully 
quick proof of (2.2) given in § 2. 


1. THE ARITHMETIC-GEOMETRIC MEAN OF REAL NUMBERS 


When a and b are positive real numbers, the properties of the agM M(a, b) 
are well known (see, for example, [5] and [26]). We will still give complete 
proofs of these properties so that the reader can fully appreciate the difficulties 
we encounter in § 2. 

We will assume that a > b > 0, and we let {a,} 9 and {b,} 9 be as 
in (0.1), where b,,, is always the positive square root of a,b,. The usual 
inequality between the arithmetic and geometric means, 


(a+b)/2 > (ab)'"*, 


immediately implies that a, > b, for all n > 0. Actually, much more is true: 
we have 


ij) £2 Sah GPa Sa Phe PRS a Sk SS 
(1.2) 0<a,—b, < 2°>"a—b). 
To prove (1.1), note that a4, > b, and a,,, > 6,4, imply 
iy > (Aq +b,)/2 = Gear > Dns = (yb,)"? > dy, 
and (1.1) follows. From b,,, > b, we obtain 


Qn+1 — Bast ~ An+1 — b, = 2-*(a,—5,) ; 
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and (1.2) follows by induction. From (1.1) we see immediately that lim a, 


avo 


and lim b, exist, and (1.2) implies that the limits are equal. Thus, we can 


ao 


use (0.2) to define the arithmetic-geometric mean M(a, b) of a and b. 
Let us work out two examples. 

Example 1. M(a,a) = a. 
This is obvious because a = b implies a, = b, = a for alln > 0. 


Example 2. M(./2, 1) = 1.1981402347355922074.. 


The accuracy is to 19 decimal places. To compute this, we use the fact 
that a, > M(a, b) > b, for all n > O and the following table (all entries are 
rounded off to 21 decimal places). 


1.414213562373905048802 1.000000000000000000000 
1.207106781 186547524401 1.189207115002721066717 
1.198156948094634295559 1.198123521493120122607 
1.198140234793877209083 1.198140234677307205798 
1.198140234735592207441 1.198 140234735592207439 


hWNwM KF CO 


Such computations are not too difficult these days, though some extra 
programming was required since we went beyond the usual 16 digits of 
double-precision. The surprising fact is that these calculations were done not 
by computer but rather by Gauss himself. The above table is one of four 
examples given in the manuscript “De origine proprietatibusque generalibus 
numerorum mediorum arithmetico-geometricorum” which Gauss wrote in 
1800 (see [12, II], pp. 361-371]). As we shall see later, this is an especially 
important example. 


Let us note two obvious properties of the agM: 


M(a, b) = M(a,,b,) = M(az, 62) = ... 
(1.3) 
M(ha, 4b) = AM(a, b). 


Both of these follow easily from the definition of M(a, b). 


Our next result shows that the agM is not as simple as indicated by 
what we have done so far. We now get our first glimpse of the depth 
of this subject. 


26 3. THE ARITHMETIC-GEOMETRIC MEAN OF GAUSS (1984) 
278 D. A. COX 
THEOREM 1.l. If a>b>O0, then 


n/2 
M(a, | (acos? + b?sin2)~ dp = 1/2. 
0 


Proof. Let I(a, b) denote the above integral, and set p = M(a, b). Thus 
we need to prove I(a, b) = (n/2)u~ 1. The key step is to show that 
(1.4) I(a, b) = I(a,, by). 

The shortest proof of (1.4) is due to Gauss. He introduces a new variable 
o’ such that , 

: 2a sind’ 

( ie a eee eels: 
as any a+b + (a—b)sin*’ 


Note that 0 < $’ < n/2 corresponds to 0 < } < n/2. Gauss then asserts 
“after the development has been made correctly, it will be seen” that 


(1.6) (a2cos + b?sin2)~ /2db = (a? cos?’ +b? sin2’) /2do’ 


(see [12, III, p. 352]). Given this, (1.4) follows easily. In “Fundamenta nova 
theoriae functionum ellipticorum,” Jacobi fills in some of the details Gauss 
left out (see [20, I, p. 152]). Specifically, one first proves that 


2 cosd’(aj cos?’ + bj sin7')"/? 


Bas Se eee 


a + b — (a—b)sin7' 


ee) Di Qi jt /2: 
(a*cos* + b*sin*o) a7 +b + (a—bysin? 


(these are straightforward manipulations), and then (1.6) follows from these 
formulas by taking the differential of (1.5). ; 
Iterating (1.4) gives us 


I(a,b) = (ay, b;) = Taz,.b2) = «« 


so that J(a, b) = lim J(a,,b,) = 1/2 since the functions 


n7~ ao 
(a? cos*h+b? sin?) */? 
converge uniformly to the constant function p7?. QED 


This theorem relates very nicely to the classical theory of complete 
elliptic integrals of the first kind, i.e., integrals of the form 
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nf2 


1 
F(k, 1/2) => | (1 —k?sin*)~*/4#do = | (1-27) (1 —k?z*))- YW2q7. 


i 


. a—b : 
To see this, we set k = aah? Then one easily obtains 


I(a,b) = a“? AM, n/a), H(a,, by) = ay? F(k, n/2), 


so that (1.4) is equivalent to the well-known formula 
k 
F (Ww ; ria) = (1+k) F(k, 2/2) 


(see [16, p. 250] or [17, p. 908]). Also, the substitution (1.5) can be written as 


(1+ k)sing’ 


me Tk sinh 


which is now called the Gauss transformation (see [32, p. 206}). 

For someone well versed in these formulas, the derivation of (1.4) would 
not be difficult. In fact, a problem on the 1895 Mathematical Tripos was 
to prove (1.4), and the same problem appears as an exercise in Whittaker 
and Watson’s Modern Analysis (see [36, p. 533]), though the agM is not 
mentioned. Some books on complex analysis do define M(a, b) and state 
Theorem 1.1 (see, for example, [7, p. 417]). a 

There are several other ways to express Theorem 1.1. For example, if 
0 < k < I, then one can restate the theorem as 


(1.7) ee eee me 12 sin? )~ 2g RR 2 
M(L+k,1-k) nJo yond = — Fk, x2). 


Furthermore, using the well-known power series expansion for F(k, 1/2) 
(see [16, p. 905]), we obtain 


1 2 [1-3-..+(2n—4) |? 
Le Oi ce A i ot 2n 
oi M(i+k, 1-& Dal Pal | ai 


Finally, it is customary to set k’ = ./1 — k?. Then, using (1.3), we can 
rewrite (1.7) as ; 


1 2 
(1.9) = | (1—k? sin?y)~ V2dy , 
mt 


Q 
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This last equation shows that the average value of the function 
(1—k? sin?y)~‘/? on the interval [0, 2/2] is the reciprocal of the agM of the 
reciprocals of the minimum and maximum values of the function, a lovely 
interpretation due to Gauss — see [12, III, p. 371]. 

One application of Theorem 1.1, in the guise of (1.7), is that the algorithm 
for the agM now provides a very efficient method for approximating the 
elliptic integral F(k, 1/2). As we will see in §3, it was just this problem 
that led Lagrange to independently discover the algorithm for the agM. 

Another application of Theorem: 1.1 concerns the arc length of the 
lemniscate r? = cos 20: 


Using the formula for arc length in polar coordinates, we see that the total 
arc length is 
n/4 


n/4 
af (r? + (dr/d0)?)*/? do = af (cos 20) */? d0. 


0 0 


The substitution cos 20 = cos* transforms this to the integral 


n/ 


n/2 2 
a| (1+cos7o)"/? db = | (2 cos? +sin2)~ 1/2 do. 


i?) 0 


Using Theorem 1.1 to interpret this last integral in terms of M(,/2, 1), 
we see that the arc length of the lemniscate r? = cos 260 is 2n/M(./2, 1). 

From Example 2 it follows that the arc length is approximately 5.244, 
and much better approximations can be easily obtained. (For more on the 
computation of the arc length of the lemniscate, the reader should consult [33].) 

On the surface, this arc length computation seems rather harmless. 
However, from an historical point of view, it is of fundamental importance. 
If we set z = cosd, then we obtain 


® 1 
1?) 


c¢) 
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The integral on the right appeared in 1691 in a paper of Jacob Bernoulli 
and was well known throughout the 18th century. Gauss even had a special 
notation for this integral, writing 


1 
O= 2| (l—24)7 2 dz. 
0 


Then the relation between the arc length of the lemniscate and M(/2, 1) 
can be written 
Tt 
M(,/2, 1) = - 


To see the significance of this equation, we turn to Gauss’ mathematical 
diary. The 98th entry, dated May 30, 1799, reads as follows: 


We have established that the arithmetic-geometric mean between | and 


ve is x/® to the eleventh decimal place; the demonstration of this 
fact will surely open an entirely new field of analysis. 


(See [12, X.1, p. 542].) The genesis of this entire subject lies in Gauss’ 
observation that these two numbers are the same. It was in trying to 
understand the real meaning of this equality that several streams of Gauss’ 
thought came together and produced the exceptionally rich mathematics 
which we will explore in § 2. 

Let us first examine how Gauss actually showed that M(/2, 1) = 7/0. 
The proof of Theorem 1.1 given above appeared in 1818 in a paper on 
secular perturbations (see [12, III, pp. 331-355]), which is the only article 
on the agM Gauss published in his lifetime (though as we’ve seen, Jacobi 
knew this paper well). It is more difficult to tell precisely when he first 
proved Theorem 1.1, although his notes do reveal that he had two proofs 
by December 23, 1799, 

Both proofs derive the power series version (1.8) of Theorem 1.1. Thus 
the goal is to show that M(1+k, 1—k)~! equals the function 


(1.10) pe (Ae en 


The first proof, very much in the spirit of Euler, proceeds as follows. 
Using (1.3), Gauss derives the identity 


2t 2t 1 
1.1 es, fa eS 2 1-12), 
(1.11) (i+, 1 a] fe 1-t*) 
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He then assumes that there is a power series expansion of the form 


1 


So 2 4 6 
MA+k 1—h 1+Ak*+ B+ Ck 4... . 


By letting k = ¢? and 2¢/(1+1¢7) in this series and using (1.11), Gauss obtains 


2¢ \? / de \4 2% \? 
1+A + B{ —— 2 — 
(Ss) (=) + fea * 


= (1+t?) (1+ At*+ Bt8+ Ct}? +...). 


Multiplying by 2t/(1+t7), this becomes 


a +A uf fs si ee = 21(1+At*+ Br® +...) 
1+? 1+t? 1+¢? os ; bia 


A comparison of the coefficients of powers of t gives an infinite system of 
equations in A, B, C,.... Gauss showed that this system is equivalent to the 
equations 0 = 1 — 4A = 9A — 16B = 25B — 36C = ..., and (1.8) follows 
easily (see [12, III, pp. 367-369] for details). Gauss’ second proof also 
uses the identity (1.11), but in a different way. Here, he first shows that 
the series y of (1.10) is a solution of the hypergeometric differential equation 


(1.12) (k2—ky” + (3k?—LDy’ + ky = 0. 


This enables him to show that y satisfies the identity 


2t 
y (Ss) = (1+17)y(t?), 


so that by (1.11), F(k) = M(1+k, 1—k)y(k) has the property that 


F = = F(t? 
iy (). 


Gauss then asserts that F(k) is clearly constant. Since F(0) = 1, we obtain a 
second proof of (1.8) (see [12, X.1, pp. 181-183]). It is interesting to note 
that neither proof is rigorous from the modern point of view: the first 
assumes without proof that M(1+k,1—k)~* has a power series expansion, 
and the second assumes without proof that M(1+k, 1—k) is continuous (this 
is needed in order to show that F(k) is constant). 


We can be certain that Gauss knew both of these proofs by December 23, 
1799. The evidence for this is the 102nd entry in Gauss’ mathematical 
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diary. Dated as above, it states that “the arithmetic-geometric mean is itself 
an integral quantity” (see [12, X.1, p. 544)). However, this statement is not 
so easy to interpret. If we turn to Gauss’ unpublished manuscript of 1800 
{where we got the example M(./2, 1)), we find (1.7) and (1.8) as expected, 
but also the observation that a complete solution of the differential equation 
(1.12) is given by 
A B 

(1.13) Matk 1-6 * MU.B’ A,BeC 
(see [12, II], p. 370]). In eighteenth century terminology, this is the “complete 
integral” of (1.12) and thus may be the “integral quantity” that Gauss was 
referring to (see (12, X.1, pp. 544-545]). Even if this is so, the second proof 
must predate December 23, 1799 since it uses the same differential equation. 

In §3 we will study Gauss’ early work on the agM in more detail. 
But one thing should be already clear: none of the three proofs of Theorem 1.1 
discussed so far live up to Gauss’ May 30, 1799 prediction of “an entirely 
new field of analysis.” In order to see that his claim was justified, we will 
need to study his work on the agM of complex numbers. 


2. THE ARITHMETIC-GEOMETRIC MEAN OF COMPLEX NUMBERS 


The arithmetic-geometric mean of two complex numbers a and 6 is not 
easy to define. The immediate problem is that in our algorithm 


ag =@ > bo = b, 
(2.1) 
G+, = (a, +,)/2 , Patt aa (ayb,)'/? a 0, 1, 25 oo) 


there is no longer an obvious choice for b,,,. In fact, since we are 
presented with two choices for b,,, for all n > 0, there are uncountably 
many sequences {a,}7, and {b,}%, for given a and b. Nor is it clear 
that any of these converge! 

We will see below (Proposition 2.1) that in fact all of these sequences 
converge, but only countably many have a non-zero limit. The limits of 
these particular sequences then allow us to define M(a,b) as a multiple 
valued function of a and b. Our main result (Theorem 2.2) gives the relation- 
ship between the various values of M(a, 6). This theorem was discovered 
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by Gauss in 1800, and we will follow his proof, which makes extensive use 
of theta functions and modular functions of level four. 

We first restrict ourselves to consider only those a’s and b’s such that 
a#0,b#0anda # +b. (If a=0, b=0 or a= +b, one easily sees that the 
sequences (2.1) converge to either 0 or a, and hence are not very interesting.) 
An easy induction argument shows that if a and b satisfy these restrictions, 
so do a, and b, for all n > 0 in (2.1). 

We next give a way of distinguishing between the two possible choices 
for each b,4,. 


Definition. Let a, be C* satisfy a # +b. Then a square root b, of ab is 
called the right choice if |a, — b,| <|a, + b,| and, when |a, — 6,| 
=|a, + by |, we also have Im(b,/a,) > 0. 


To see that this definition makes sense, suppose that Im(b,/a,) = 0. 
Then b,/a, = re R, and thus 


la,—b,|=la][1—r] Ala] ]1+r|] =la, + d,| 


since r # 0. Notice also that the right choice is unchanged if we switch a 
and b, and that if a and b are as in § 1, then the right choice for (ab)*/? is the 
positive one. 

It thus seems natural that we should define the agM using (2.1) with 
b,+; always the right choice for (a,b,)'/?. However, this is not the only 
possibility: one can make some wrong choices for b,,, and still get an 
interesting answer. For instance, in Gauss’ notebooks, we find the following 
example: 


0 3.0000000 1.0000000 
1 2.0000000 — 1.7320508 
2 1339746 1.8612098i 
3 0669873 + .9306049i 3530969 + .3530969i 
- .2100421 + .6418509i .2836903 + .6208239i 
5 2468676 + .63133741 2470649 + .6324002i 
6 2469962 + .6318688i 2469962 + .6318685i 


(see [12, III, p. 379]). Note that b, is the wrong choice but 5, is the right 
choice for n > 2. The algorithm appears to converge nicely. 
Let us make this idea more precise with a definition. 
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Definition. Let a,beC* satisfy a # +b. A pair of sequences {a,} 5 
and {b,}% as in (2.1) is called good if b,,, is the right choice for 
(a,b,)'/? for all but finitely many n > 0. 

The following proposition shows the special role played by good sequences. 


PROPOSITION 2.1. If a,beEC* satisfy a# tb, then any pair of 
sequences {a,}=.9 and {b,}“.5 as in (2.1) converge to a common limit, 
and this common limit is non-zero if and only if {a,}, and {b,}2.9 are 
good sequences. 


Proof. We first study the properties of the right choice b, of (ab)'/? 
in more detail. Let 0 < ang(a, b) < m denote the unoriented angle between a 
and b, 


Then we have: 
(2.2) la, —b,| < (1/2)|a - | 
(2.3) ang(a,,b;) < (1/2) ang(a, b). 
To prove (2.2), note that 
la, — b, | ee b, | = (1/4)]a — b|?. 


Since | a; — b,| < a, + d, |, (2.2) follows immediately. To prove (2.3), let 
6, = ang(a,,b,) and 8 = ang(a, b). From the law of cosines 


fa, + by |? = a, |? + 1b, |? +2] a, | |b, | cos, , 
we see that 8, < 7/2 because | a, — b,| < | a, + b,|. Thus 
ang(a,,;) = 8, < m~—- 0, = ang(a,, —b,). 


To compare this to 0, note that one of +b,, say b4, satisfies ang(a, b4) 
= ang(b,,b) = 0/2. Then the following picture 


b4 


a, 


o ¢ 
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shows that ang(a,, b’,) < 8/2. Since b’, = +b,, the above inequalities imply 
that 
ang(a; , b,) < ang(a,,b4) < (1/2) ang(a, b), 


proving (2.3). 
Now, suppose that {a,}*. and {b,}., are not good sequences. We set 
M,, = max{|a,|,| 5, |}, and it suffices to show that lim M, = 0. Note that 


M,,.; <M, for n > 0. Suppose that for some n, b,,, is not the right choice 
for (a,b,)'/*. Then —b,., is the right choice, and thus (2.2), applied to a, 


and b,,, implies that 
© Gea] = (1/2) nar — basa | < (1/4) | dy — by 1 < (1/2)M,. 
However, we also have | b,,2| < M,,. It follows easily that 
(2.4) M,+3 < (3/4)M,. 
Since {a,}*.) and {b,}*) are not good sequences, (2.4) must occur infinitely 


often, proving that lim M, = 0. 


Next, suppose that {a,}*) and {b,}*.5 are good sequences. By neglecting 
the first N terms for N sufficiently large, we may assume that b,,, is 
the right choice for all n > 0 and that ang(a, b) < x (this is possible by (2.3)). 
We also set 6, = ang(a,, b,). From (2.2) and (2.3) we obtain 


(2.5) la —h&| <2 "|a—b|], 8,<—2°°6,. 
Note that a, — a,,, = (1/2) (a,—b,), so that by (2.5), 
[%. — G a3) Se a — 8] 


Hence, if m >'n, we see that 
m-1 m-1 
\a,— Gals ¥ le — ae S| YOY | le blL <2"1a— 5. 
I k=n 


Thus {a,}°.» converges because it is a Cauchy sequence, and then (2.5) 
implies that lim a, = lim b,. 
n7?>o noo 
It remains to show that this common limit is nonzero. Let 
m, = min{|a,|,| 5, |}. 


Clearly | b,,, | = m,. To relate |a,,,| and m,, we use the law of cosines: 
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(2lay4,1)? = 1a,}7 + 1b, 1? + 212, |b, | cosd, 


> 2m?2(1+cos8,) = 4m? cos(6,/2) . 


It follows that m,,, 2 cos(8,/2)m, since 0 < 0, < x (this uses (2.5) and the 
fact that 8) = ang(a, b) < x). Using (2.5) again, we obtain 


m, = (i cot2} Mo 
k=1 


However, it is well known that 
sin8, 


ia cos(6,/2") = 


tt) 


(See [16, p. 38]. When 0, = 0, the right hand side is interpreted to be 1.) We 


thus have 
sing 
2G) 


for alln > 1. Since 0 < 8) < 1, it follows that lim a, = lim b, #0. QED 


n+o n7e@ 


We now define the agM of two complex numbers. 


Definition. Let a,beC* satisfy a # +b. A nonzero complex number pu 
is a value of the ne -geometric mean M{(a,b) of a and b if there 
are good sequences {a,} %» and {b,} 5 as in (2.1) such that 

ph = lim a, = lim 8,. 
Thus M(a, b) is a multiple valued function of a and b and there are a 
countable number of values. Note, however, that there is a distinguished 
value of M(a, 6), namely the common limit of {a,}, and {b,}*) where 
5,4, is the right choice for (a,b,)'’? for all n > 0. We will call this the 
simplest value of M(a,b). When a and 6b are positive real numbers, this 
simplest value is just the agM as defined in § 1. 


We now come to the major result of this paper, which determines how 
the various values of M(a, b) are related for fixed a and b. 


THEOREM 2.2. Fix a,beC* which satisfy a# ib and \a|2|b|, 
and let jt and % denote the simplest values of M(a,b) and M(a+b,a—b) 
respectively, Then all values yt’ of M(a,b) are given by the formula 
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where d and c_ are arbitrary relatively prime integers satisfying 
d=1mod4 and c=Omod 4. 


Proof. Our treatment of the agM of complex numbers thus far has 
been fairly elementary. The proof of this theorem, however, will be quite 
different; we will finally discover the “entirely new field of analysis” predicted 
by Gauss in the diary entry quoted in §1. In the proof we will follow 
Gauss’ ideas and even some of his notations, though sometimes translating 
them to a modern setting and of course filling in the details he omitted 
(Gauss’ notes are extremely sketchy and incomplete — see [12, III, pp. 467- 
468 and 477-478}). 

The proof will be broken up into four steps. In order to avoid writing a 
treatise on modular functions,.we will quote certain classical facts without 
proof. 


Step 1. Theta Functions 
Let § = {t¢C:Imt > 0} and set q = e™*. The Jacobi theta functions 
are defined as follows: 


pt) = 1+2 Yq” = O,7,0), 
qe) = 1 +2 ¥ (—1'q" = O40), 


re) = 2 Dg" = O,(1, 0). 
n=1 


Since |q| < 1 for te §, these are holomorphic functions of t. The notation 
p, q and r is due to Gauss, though he wrote them as power series in. 
e “, Ret > O (thus he used the right half plane rather than the upper half 
plane § — see [12, III, pp. 383-386]). The more common notation 03, ©, 
and @, is from [36, p. 464] and [32, p. 27]. 

A wealth of formulas are associated with these functions, including the 
product expansions: 


p(t) = I (1—q*")(1+q""")?, 


(2.6) qt) = I aq egy 5 
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r(t) = 2q'/* [] (1-q*")(1+q7")’, 
n=} 


(which show that p(t), g(t) and r(t) are nonvanishing on S$), the trans- 
formations: 


pt+1) = g(r), p(—1/t) = (—it)' pit), 
(2.7) qit+1) = p(t), qa—1/t) = (-it)"7r(0), 
r(tt+l) = e™*x(1), (—1/t) = (—it)"? q(t), 


(where we assume that Re(—it)'/? > 0), and finally the identities 


p(t)? + g(t)? = 2p(2t)*, 
(2.8) p(t)? — a(x)? = 2r(2t)’, 
At)q(t) = g(2r)’, 


and 


p21)? + r(2t)? = pr)’, 
(2.9) p20)? — r(21)? = g(t), 
q(t)* + r(t)* = ptr)’. 


Proofs of (2.6) and (2.7) can be found in [36, p. 469 and p. 475], while 
one must turn to more complete works like [32, pp. 118-119] for proofs of 
(2.8). (For a modern proof of (2.8), consult [34].) Finally, (2.9) follows easily 
from (2.8). Of course, Gauss knew all of these formulas {see [12, IIL 
pp. 386 and 466-467}). 

What do these formulas have to do with the agM? The key lies in (2.8): 
one sees that p(2t)? and q(2t)? are the respective arithmetic and geometric 
means of p(t)? and q(t)?! To make the best use of this observation, we 
need to introduce the function k/(t) = q(t)*/p(t)?. 

Then we have: 


LeMMA 2.3. Let a,beC* satisfy a # +b, and suppose there is tE 9 
such that k(x) = b/a. Set yp = a/p{t)? and, for n>O0, a, = pw p(2"r)? 
and b, = pq(2"1)*. Then 
(i) {a,}"_ and {b,}".. are good sequences satisfying (2.1), 

(ii) lim a, = lim b, = p. 

n> oG n> oO 

Proof. We have dg = a by definition, and by = b follows easily from 
k(t) = b/a. As we observed above, the other conditions of (2.1) are clearly 
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satisfied. Finally, note that exp(i2"t) > 0 as n— co, so that lim p(2"1)? 


n->o 


= lim q(2"t)? = 1, and (ii) follows. Since 1 4 0, Proposition 2.1 shows that 


{a,} 29 and {b,}* ) are good sequences. QED 

Thus every solution t of k(t) = b/a gives us a value p = a/p(t)? of 
M(a, b). As a first step toward understanding all solutions of k(t) = b/a, we 
introduce the region F, € §: 


F, = {te §:|Ret| <1, | Re(i/t)| < 1} 


F, shaded 


=] O | 
The following result is well known. 
LEMMA 2.4. k’? assumes every value in C — {0,1} exactly once in 
Fi, = F, — (OF ,n{te§: Ret <0}). 
A proof can be found in [36, pp. 481-484]. Gauss was aware of similar 


results which we will discuss below. He drew F, as follows (see [12, III, 
p. 478)). 


1 
Raum fiir t und — 
t 


Note that our restrictions on a and b ensure that (b/a)*=eC — {0, 1}. 
Thus, by Lemma 2.4, we can always solve k(t)? = (b/a)’, ie., k(t) = +b/a. 
We will prove below that 


, T ett 
(2.10) k (<3) = —k(), 
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which shows that we can always solve k(t) = b/a. Thus, for every a and b 
as above, M(a, b) has at least one value of the form a/p(t}”, where k(t) = b/a. 

Three tasks now remain. We need to find all solutions t of k’(t) = b/a, 
we need to see how the values a/p(t)’ are related for these t’s, and we need 
to prove that all values of M(a,b) arise in this way. To accomplish these 
goals, we must first recast the properties of k‘(t) and p(t)? into more modern 
terms. 


Step 2. Modular Forms of Weight One. 

The four lemmas proved here are well known to experts, but we include 
their proofs in order to show how easily one can move from the classical 
facts of Step 1 to their modern interpretations. We will also discuss what 
Gauss had to say about these facts. 

We will use the transformation properties (2.7) by way of the group 


b 
SQ, Z) = 1(¢ AF a,b,c, deZ, ad—be = i} 


which acts on § by linear fractional transformations as follows: if 
at+b 


y= « f)€ SUs2,Z) and x  § then = eer i 


For example, if 


0 -1 1 1 —1 
§ = = = = 
( i ) and T ({ i} then St oat Tr=t+i1, 


which are the transformations in (2.7). It can be shown that S§ and T 
generate SL(2, Z) (see [29, Ch. VIL, Thm. 2]), a fact we do not need here. 

We will consider several subgroups of SL(2, Z). The first of these is I'(2), 
the principal congruence subgroup of level 2: 


1 
TQ) = {ye SLQ,Z):7 = (( i od 2}. 
Note that —i ¢ F'(2) and that I'(2)/{+1} acts on §. 
LEMMA 2.5. 
(i) TQV{+1} acts freely on §. 
(ii) (2) is generated by —1,U = io :) Py ae ae e AF 


2 1 
(ili) Given te §, thereis yeT(2) such that yteF e 
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b 
Proof. Let y = C i) be an element of I(2). 


(i) If t€ § and yt = 1, then we obtain ct” + (d—a)t —-b = 0. If c = 0, 
then y = +1 follows immediately. If c # 0, then (d—a)? + 4 bc < 0 because 
te. Using ad — bc = 1, this becomes (a+d)? < 4, and thus a+ d=0 
since a and d are odd. However, b and c are even so that 


1 = ad — be = ad = — a* mod4 


This contradiction proves (i). 


(ii) We start with a variation of the Euclidean algorithm. Given y as 
above, let r; = a — 2a,c, where a, € Z is chosen so that |r, | is minimal. 


Then |r, | <|c|, and hence |r, | <|c| since a and c have different parity. 
Thus 


@=2aje+7r, Qs5rmEB; |rml<lels 


Note that c and r, also have different parity. Continuing this process, we 
obtain 


¢=2a,rnt+r,, Irl<Irnil, 
ry = 2a3r2 +173, I|Ir31<Iral, 
Tan=1 = 242n+1 Tan + Tanti. one = af bis 


lon = 2Gon+2Ton+1 + 9, 


since GCD(a, c) = 1. Then one easily computes that 
~A2n+2 ~G2n+1 at ay sae . 
V U ee a i 6 4)" 


Since the left-hand side is in I'(2), the right-hand side must be of the form 
+U”™, and we thus obtain 


y = + Ut V%,., Usenet Yarns Ym , 


(iii) Fix t€ §. The quadratic form |xt + y|? is positive definite for 
x, y€R, so that for any S < Z?, | xt + y|? assumes a minimum value at 


b 
some (x, y) € S. In particular, | ct + d|?, where y = ‘e Je T(2), assumes a 


d 


minimum value at some yp) €I(2). Since Im yt = Im t| ct + d|~?, we see 
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that t' = yot has maximal imaginary part, ie., Im vt’ > Im yt’ for y € (2). 
Since Im 1’ = Im Uv’, we may assume that | Re t’] < 1. Applying the above 
inequality to V*? €T(2), we obtain 


Imv > Im V*?t) = Im? [20 +1177. 


Thus | 2t +1| > 1, or | t + (1/2)| > 1/2. This is equivalent to | Re 1/*’] < 1 
and hence t'e F,. QED 


We next study how p(t) and q(t) transform under elements of P(2). 


Lemma 2.6, Let y = & ’) éT(2), and assume that a=d= 1 mod 4, 
c 


Then 
(i) plyt)? = (ct +d) pt)’, 
(ii) glyt)? = tet +d) g(t)’. 


0 -i1 01 . 
Proof. From (2.7) and ¥ = ( ) u- ( ) we obtain 


1 0 —! Q 
Uy = pty?, pV)? = (2t+1) po’, 
(2.11) aUr)? = gt)?, (Vt)? = —(Q2t+1) (0). 


Thus (i) and (11) hold for U and V. The proof of the previous lemma shows 
that y is in the subgroup of [°(2) generated by U and V. We now proceed 
by induction on the length of y as a word in U and V. 


, b 
fi) If y = (? ‘) and p(yt)* = (ct+d) p(t)? then (2.11) implies that 


p(Uyt)? = p(y)? = (ct+4) pv’, 
pViyt)? = (2yt+ 1) (yt)? = (2yt+ 1) (et +d) p(t? 
= ((2a+c)t+(2b+d)) p(t)? . 
However Uy = a , , Vy = ( a an Pe so that (i) now holds for 


Uy and Vy, 
(ii) Using (2.11) and arguing as above, we see that if 7 = 6 i) 
- c 
= Ut yh Ut Yen then 


aye)? = (—1*(et +d) g(x). 
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However, U and V commute modulo 4, so that 


1 2ka; 
= ‘ d4. 
' 3 1 ) 5 


Thus c = 2b; mod 4, and (ii) follows. QED 


Note that (2.10) is an immediate consequence of Lemma 2.6. 
In order to fully exploit this lemma, we introduce the following subgroups 
of I'(2): 
I(2)o = {ye T(2):a = d = 1 mod 4}, 


T,(4) = {ye T(Q)o:c = 0 mod 4} 


Note that I'(2) = {+1}-I(2)) and that I’,(4) has index 2 in I'(2)). From 
Lemma 2.6 we obtain , 


pvt)? = (ct+d) p(t)’, ye T(2)o, 
(2.12) 

gy = (ctt+d) q(t’, yer,f4). 
Since these functions are holomorphic on §, one says that p(t)? and q(t)? 
are weak modular forms of weight one for I(2)) and I’,(4) respectively. 
The term more commonly used is modular form, which requires that the 
functions be holomorphic at the cusps (see [30, pp. 28-29] for a precise 
definition). Because I(2)) and I,(4) are congruence subgroups of level 
N = 4, this condition reduces to proving that 


(2.13) (ct+d)~* p(yt)?, (et +d)? a(yt)’, 


are holomorphic functions of q!/? = exp(2mit/4) for all y e SL(2, Z). This will 
be shown later. 

In general, it is well known that the square of a theta function is a 
modular form of weight one (see [27, Ch. I, § 9]), although the general theory 
only says that our functions are modular forms for the group 


T(4) = {ye SL2,Z):y = : 7 mod 4} 


(see [27, Ch. I, Prop. 9.2]). We will need the more precise information 
given by (2.12). 

We next study the quotients of § by I(2) and I,(4). From Step 1, 
recall the region F, © §. We now define a larger region F: 


F = {te§:|Ret| < 1, |t +1/4| > 1/4, |t +3/4| > 1/4}. 
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F shaded 


F, above dashed lines 


=| =i/2 2) 1/2 | 


We also set 
Fi, = F, — (6F, 9 {ve §: Ret < 0}) 
F = F — (éF mn {te€§: Ret < 0}). 


LEMMA 2.7. F’, and F' are fundamental domains for T(2) and Y4) 
respectively, and the functions k’? and k’ induce biholomorphic maps 


k?; $/T(2) > C - {0,3 
K : $74) > C — (0, +1}. 
Proof. A simple modification of the proof of Lemma 2.6 shows that if 
= ' 4 € F(2), then p(yt)* = (et +4)? p(t)*, q(yt)* = (ct +a)? g(0)*. Thus 
¢ 


k’ is invariant under I(2). 
Given te §, Lemma 2.5 shows that yreF, for some yeI(2). Since 


2 
U = € , maps the left vertical line in OF, to the mght one and 


Y= (; 0 maps the left semicircle in OF, to the right one, we may 


assume that yte F‘,. If we also had ote F’, for o¢I(2), then k{or)* 
= k(t)? = k(yt)’, so that ot = yt by Lemma 2.4. This shows that F{, is a 
fundamental domain for I'(2). 

Since T(2)o ~ F'(2)/{41}, F is also a fundamental domain for I(2),. 


Since [,(4) has index 2 in I(2)y with 1 and V as coset representatives, 
it follows that 


F* = Fi U VFL (te: Ret < O})U V (FY 9 {te §: Ret > 0}) 
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F* shaded 


aa = PE O \/2 | 


2 a €T,(4) takes the far 
left semicircle in OF to the far right one, it follows that F’ is a fundamental 
domain for I,(4). 

It now follows easily from Lemma 2.4 that k’? induces a bijection 
k? : $/TQ) + C — {0, 1}. Since I'(2)/{+1} acts freely on § by Lemma 2.5, 


§/T(2) is a complex manifold and k? is holomorphic. A straightforward 


—3 -2 
is a fundamental domain for I’,(4). Since ( 


argument then shows that k? is biholomorphic. 
Next note that k’ is invariant under I’,(4) by (2.12), and thus induces a 


map k’: $/T (4) > C — {0, +1}. Since §/I(2) = §/T(2)), we obtain a com- 
mutative diagram: 


9/74) & C— (0,13 
fl Lg 
G/T(2)o > C — {0,1} 


where f is induced by I’,(4) € I'(2)) and g is just g(z) = z?. Note that g is a 
covering space of degree 2, and the same holds for f since [I(2)): I',(4)] = 2 
and I(2)) acts freely on §. We know that k? is a biholomorphism, and 
it now follows easily that K is also. QED 


We should point out that r(t)* has properties similar to p(t)? and q(t)’. 
Specifically, r(t)? is a modular form of weight one for the group 


r,(4! = {yeT(2):y= (: i) mod 4} , 


which is a conjugate of I,(4). Furthermore, if we set k(t) = r(t)*/p(t)?, 
then k is invariant undér I’,(4)' and induces a biholomorphism k: §/T ,(4)' 
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+ C — {0, +1}. We leave the proofs to the reader. Note also that k(t)? 
+ k(t? = 1 by (29). 

Our final Iemma will be useful in studying the agM. Let F, be the 
region (1/2)F,, pictured below. Note that F; ¢ F. 


= Ife... O 1/2 | 


F, shaded 
F, F, indicated by dashed lines 


LEMMA 2.8. 
k(F,) = {ze C — {0,41}: Rez > 0}, 
K(F,) = {zeC — {0, +1}:|z| < 1}. 
Proof. We will only treat k'(F,), the proof for k(F,) being quite similar. 
We first claim that {k(t): Ret = +1/2} = S* — {+1}. To see this, note 
that Ret = +1/2 and the product expansions (2.6) easily imply that k@) 
= k'(t)7}, ie, | k'(t)| = 1. How much of the circle is covered? It is easy 


to see that k(+1/2+it) + 1 as t+ + a. To study the limit as t +0, 
note that by (2.10) we have 


k(41/2+it) = —k’ (4 v2+7). 


As t- 0, the right-hand side clearly approaches ~1. Then connectivity 
arguments easily show that all of S' — {+1} is covered. 
Since k’ is injective on F’ by Lemma 2.7, it follows that k’(F,) — S! 
is connected. Since | k’(it)| < 1 for t > O by (2.6), we conclude that 
K(F,) S {ze — {0, th} :|z| < 1}. 
Similar arguments show that 


kK(F—F,) S {zeC:|z| > i}. 
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Since k'(F) = C — {0, +1} by Lemma 2.7, both inclusions must be equalities. 
QED 


Gauss’ collected works show that he was familiar with most of this 
material, though it’s hard to tell precisely what he knew. For example, he 
basically has two things to say about k’(t): 


(i) k(t) has positive real part for te F,, 


(ii) the equation k(t) = A has one and only one solution te F,. 


(See [12, III, pp. 477-478].) Neither statement is correct as written. Modifica- 
tions have to be made regarding boundary behavior, and Lemma 2.8 shows 
that we must require | A | < 1 in (ii). Nevertheless, these statements show that 
Gauss essentially knew Lemma 2.8, and it becomes clear that he would not 
have been greatly surprised by Lemmas 2.4 and 2.7. 

Let us see what Gauss had to say about other matters we’ve discussed. He 
was quite aware of linear fractional transformations. Since he used the right 
half plane, he wrote . 

Pate wiske=l, qkete%, Biro 
cit+d 
(see [12, III, p. 386]). To prevent confusion, we will always translate formulas 
into ones involving t € §. 

Gauss decomposed an element y € SL(2, Z) into simpler ones by means of 
continued fractions. For example, Gauss considers those transformations 
t* = yt which can be written as 


(2.14) o =—— +} 2a, 


— 1 
c* = ef) = —__ + Ja, 
t 


1 2 L 
(see [12, X.1, p. 223]). If U = i ) and V = a th thea." = UV 74x, 


so that for n even we see a similarity to the proof of Lemma 2.5 (ii). 
The similarity becomes deeper once we realize that the algorithm used in 


a b 
the proof gives a continued fraction expansion for a/c, where y = (? * 
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However, since n can be odd in (2.14), we are dealing with more than just 
elements of I°(2). 

Gauss’ real concern becomes apparent when we see him using (2.14) 
together with the transformation properties of p(t). From (2.7} he obtains 


p(t*) = ./(—it) (—iv)}+ ++ (—it"” ») p(r) 
(see [12, X.1, p. 223}). The crucial thing to note is that if t* = yt, 
b 
a (? i) then (—it)+++(—it”"”) is just ct +d up to a power of i. 
c 


This tells us how p(t) transforms under those y’s described by (2.14). In 
general, Gauss used similar methods to determine how p(t), g(t) and r(t) 
transform under arbitrary elements y of SL(2, Z), The answer depends in part 


b 
on how y = ¢ i) reduces modulo 2. Gauss labeled the possible reductions 


as follows: 


6 


(see [12, X.1, p. 224]). We recognize this as the isomorphism SL(2, Z)/T(2) 
~ S12, F,), and note that (2.14) corresponds to cases 1 and 6. Then the 


transformations of p(t), q(t) and r(t) under y = (? ‘) € SL(2, Z) are given by 


1 2 3 4 5 6 
h~* wy) = | pO | a | rm) | at) [ Mr) | ply) 
(2.15) A ayt) = | a | vO) | POO | ret) | per) | rt) 
ho ryt) = | rt) | re) | a) | ro | a) tat) 


where h = (i(ct+d))'? and > is an integer depending on both y and which 
one of p(t). q(t} -or r(t) is being transformed (see [12, X.1, p. 224]). Note that 
Lemma 2.6 can be regarded as giving a careful analysis of X in case 1. 
An analysis of the other cases may be found in [13, pp. 117-123]. One 
consequence of this table is that the functions (2.13) are holomorphic functions 
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of q’/?, which proves that p(t)’, q(t)? and r(t)? are modular forms, as 
claimed earlier. 

Gauss did not make explicit use of congruence subgroups, although they 
appear implicitly in several places. For example, the table (2.15) shows Gauss 
using I'(2). As for T'(2)), we find Gauss writing 


k'(yt) = itk'(t) 


a -—b 
where y = te d 
a=d=1mod4, b, c even” (see [12, III, p. 478]). Also, if we ask which 
of these y’s leave k’ unchanged, then the above equation immediately gives 
us I°,(4), though we should be careful not to read too much into what 
Gauss wrote. 

More interesting is Gauss’ use of the reduction theory of positive definite 
quadratic forms as developed in Disquisitiones Arithmeticae (see [11, § 171)). 
This can be used to determine fundamental domains as follows. A positive 
definite quadratic form ax? + 2bxy + cy* may be written a| x — ty |? where 
te. An easy computation shows that this form is equivalent via an 
element y of SL(2, Z) to another form a’ | x — t’y |? if and only if t’ = y~'t. 
Then, given te §, Gauss applies the reduction theory mentioned above to 
|x — ty|? and obtains a SL(2, Z)— equivalent form A|x — ty|? = Ax? 
+ 2Bxy + Cy? which is reduced, ic. 


) and, as he carefully stipulates, “ad — bc = 1, 


2(|Bi/ <Ae€ 


(see [11, §171] and [12, X.1, p. 225]). These inequalities easily imply that 
| Ret’ | < 1/2, | Re 1/t’ | < 1/2, so that t’ lies in the shaded region 


+f = Ire fe) 172 | 
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which is well known to be the fundamental domain of SL(2, Z) acting on 5 
(see [29, Ch. VII, Thm. 1)). 

This seems quite compelling, but Gauss never gave a direct connection 
between reduction theory and fundamental domains. Instead, he used reduc- 
tion as follows: given te, the reduction algorithm gives t = yt as 
above and at the same time decomposes y into a continued fraction similar 
to (2.14). Gauss then applies this to relate p(t’) and p(t), etc., bringing us 
back to (2.15) (see [12, X.1, p. 225]). But in another place we find such 
continued fraction decompositions in close conjunction with geometric 
pictures similar to F, and the above (see [12, VIII, pp. 103-105]). Based 
on this kind of evidence, Gauss’ editors decided that he did see the connection 
(see [12, X.2, pp. 105-106]). Much of this is still a matter of conjecture, 
but the fact remains that reduction theory is a powerful tool for finding 
fundamental domains (see [6, Ch. 12]) and that Gauss was aware of some 
of this power. 

Having led the reader on a rather long digression, it is time for us to 
return to the arithmetic-geometric mean. 


Step 3. The Simplest Value 
Let F* = {te F:|t— 1/4|> 1/4, |t + 3/4| > 1/4}. We may picture 
F* as follows. 


F* shaded 


=] =|f2 ) 1/2 | 


Let a, be C* be as usual, and let t € § satisfy k'(t) = b/a. From Lemma 2.3 
we know that p = a/p(t)? is a value of M(a, b). The goal of Step 3 is to 


prove the following lemma. 
LEMMA 2.9. If teF*, then ww is the simplest value of Ma, b). 
Proof. From Lemma 2.3 we know that 

(2.16) a, = pp(2"1)?, by, = pg(2"t)?, nn = 0,1, 2,... 


gives us good sequences converging to p. We need to show that b+, is the 
right choice for (a,b,)*/? for alln > 0 
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The following equivalences are very easy to prove: 


'b 
Panis — Ga St aueg * 8,4) [= Re (24) 20 


Qn+1 


‘ 


‘bh 
Lee bee Sey HBS [= Re (74) cosh: 


Anas 


Recalling the definition of the right choice, we see that we have to prove, 


\ 


Bava b elit 
bala SO sat Re ( al "| > o,andif Re ( vt = 0, then Im (4) > 0. 


Ont 1 yea) n+ 
From (2.16) we see that 


so that we are reduced to proving that if te F*, then for all n > 0, 
Re{k'(2"*'t)) = 0, and if Re(k'((2"*'1)} = 0, then Im(k{2"**1)) > 0. 

Let F , denote the region obtained by translating F, by + 2, + 4, etc. 
The drawing below pictures both F, and F. 


-2 -| e) | Y soe 


Ff, shaded 
F indicated by dashed lines 


Since &’(t) has period 2 and its real part is nonnegative on F, by Lemma 2.8, 
it follows that the real part of k(t) is nonnegative on all of F,. Further- 
more, it is clear that on F,, Re(k'()) == 0 can occur only on OF,. The 
product expansians {2.6) show that k(t) is real when Ret = + 1, so that 
on F,, Re(k’(t)} = 0 can occur only on the boundary semicircles. From the 
periodicity of k(t) we conclude that &‘{t) has positive real part on the 
interior £° of Fy. 

If te F*, then the above drawing makes it clear that 2"*'teF, for 
n> 0 and that 2"*'te F? for n > 1. We thus see that Re(k'(2"t11)) > 0 
for n = 0 unless n = O and 2te€ 0F;. Thus the lemma will be proved once 
we show that Im(k/(2z)} > 0 when t¢ F* and 2t € dF. 

These last two conditions imply that 21 hes on one of the semicircles A 
and B pictured below, 
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-| 0 | 2 


By periodicity, k’ takes the same values on A and B. Thus it suffices to 
0 -1 : 
show that Im(k(21)) > 0 for 2t1e€ A. Since S = f : maps the line 


Reo = | to A, we can write 21 = —1/o, where Reo = 1. Then, using (2.7), 
we obtain 


q-Voy _ x0)? 
AH-I/oyY p(s)? 


Since Reo = 1, the product expansions (2.6) easily show that 


k'(2t) = k(-1/s) = 


Im(7(o)?/p(o)?) > 0, 
which completes the proof of Lemma 2.9. QED 


Step 4. Conclusion of the Proof. 

We can now prove Theorem 2.2. Recall that at the end of Step 1 we 
were left with three tasks: to find all solutions t of k(t) = b/a, to relate 
the values of a/p{t)? thus obtained, and to show that all values of M(a, b) 
arise in this way. 

We are given a,be C* with a # +b and |a| 2b]. We will first find 
to€F, 0 F* such that k'(ty) = b/a. Since { b/a| < 1, Lemma 2.8 gives us 
tT €F, with k(t.) = b/a. Could ty fail to lie in F*? From the definition 
of F*, this only happens when t, lies in the semicircle B pictured below. 


F, shaded 


=l42 ) \/2 
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0 
However, y = ( Jer (4) takes B to the semicircle A. Since k’ is 


—4 1 
invariant under I’,(4), we have k’(yt,.) = k(t) = b/a. Thus, replacing ty by 
YT), We may assume that t) € F, 0 F”. 

It is now easy to solve the first two of our tasks. Since k’ induces a 
bijection §/T',(4) = C — {0, +1}, it follows that all solutions of k(t) = b/a are 
given by t = yt>o, y€I2(4). This gives us the following set of values of 
M(a, b): 


{a/p(yto)” : ye 2(4)} . 


Recalling the statement of Theorem 2.2, it makes sense to look at the 
reciprocals of these values: 


R = {p(yt,)?/a: y eT ,(4)} 


By (2.12), p(yto)? = (cto +d) p(t)” for y = : )e T',(4) < T(2)9. Setting 


u = a/p(to)*, we have 


b 
R = {(cto+d) p(to)’/a: y = is 4 eT 2(4)} 


b 
= {(cto+@/piy = '- i) eT ,(4)}. 


An easy exercise in number theory shows that the bottom rows (c, d) of 
elements of I’,(4) are precisely those pairs (c,d) satisfying GCD(c, d) = 1, 
c = Omod 4 and d = 1 mod 4. We can therefore write 


R = {(cto+@)/p: GCD(c,d) = 1, c=Omod4, d=1mod 4}. 
Then setting 4 = ip/tg gives us 
dic 
QAiy) R= ‘" + a: GCD(c,d) = 1, d=1mod4, c= omod 4} 
p 
Finally, we will show that p and A are the simplest values of M(a, b) 
and M(a+b,a—b) respectively. This is easy to see for p: since ty € F%, 
Lemma 2.9 implies that pp = a/p(t,)? is the simplest value of M(a, b). 
Turning to A, recall from Lemma 2.3 that a = pp(to)? and b = pq(to)’. 
Thus by (2.8) and (2.7), 
: i —1\? 
a + b = w(t)? +4(to)”) = 2p p(2t)” = 2p (=)o(=) ; 


2To 2 
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on —1\? 
7 2 A Oto * 
which implies that 


a+b=APW—1/2—)?, a—b=X|q—1/2t9)’. 


\ 


a—b= (p(t)? — q(to)7} = 2p r(2t»)? 


Hence A is a value of M(a+b,a—b). To see that it is the simplest value, 
we must show that —1/219¢ F* (by Lemma 2.9). Since ty € F2, we have 


0 —1 
2t,€F,. But F, is stable under S = ( 0 


The inclusion F, S F* is obvious, and —1/2t,) € F* follows. This completes 
our first two tasks. 

Our third and final task is to show that (2.17) gives the reciprocals of 
all values of M(a, b). This will finish the proof of Theorem 2.2. So let p’ be a 
value of M(a, b), and let {a,} 9 and {b,} 9 be the good sequences such that 
w = lim a, = lim 6,. Then there is some m such that b,,, is the right 


choice for (a,b,)'/? for all n > m, and thus yp’ is the simplest value of 
M(a,,, 6,,). Since k': F’ + C — {0, +1} is surjective by Lemma 2.7, we can find 
te F’ such that k'(z) = b,/a,. Arguing as above, we may assume that 
te F*. Then Lemma 2.9 shows that p’ = a,,/p(t)? and also that for n > m, 


} SO that —1/2t ,€ Fy. 


(2.18) a, =p par"), b, = pg” ™). 


Let us study a,,-, and 6,,_,. Their sum and product are 2a, and 62 
respectively. From the quadratic formula we see that 


{@n—12 On—1} = {aq + (a2 —b2)47} 
Using (2.9), we obtain 
az, — 6%, = w (p(t) —alt)*) = pi 7r(*, 
so that, again using (2.9), we have 


u’p(t/2)? 
Om + (Am — Be)? = w(a(r)? +r(t)*) = - 
p’q(t/2)? . 
Thus, either 


Ona = W plt/2)’, by-a = HW g(t/2)? oF ayy = pl aft/2)?, by = pw p(c/2). 


54 3. THE ARITHMETIC-GEOMETRIC MEAN OF GAUSS (1984) 
306 D. A. COX 


In the former case, set t, = 1/2. Then from (2.18) we easily see that for 
n>m-— 1, 


(2.19) a, =p pa" "*'t,)?, by = pi g(2?-™**t,)?. 


If the latter case holds, let t, = 1/2 + 1. From (2.7) we see that a,,_, 
= w' p(t,)*, by, = wW g(t,)*, and it also follows easily that p(2"~-"*!1,) 
= p(2"~™t) and q(2"~"**t,) = q(2"~™t) for all n > m. Thus (2.19) holds for 
this choice of t, andn > m — 1. 

By induction, this argument shows that there is t,, € § such that for all 
n>0, 


@, = W' p(2"t,), by = W g(2"t,)? . 


In particular, yp’ = a/p(t,,)? and k’(t,,) = b/a. Thus (y’)~! = p(t,,)?/a is in the 
set R of (2.17). This shows that R consists of the reciprocals of all values 
of M(a, b), and the proof of Theorem 2.2 is now complete. QED 


We should point out that the proof just given, though arrived at 
independently, is by no means original. The first proofs of Theorem 2.2 
appeared in 1928 in [15] and [35]. Geppert’s proof [15] is similar to ours 
in the way it uses the theory of theta functions and modular functions. 
The other proof [35], due to von David, is much shorter; it is a model of 
elegance and conciseness. 

Let us discuss some consequences of the proof of Theorem 2.2. First, the 
formula A = ipt/t) obtained above is quite interesting. We say that t) 
“uniformizes” the simplest value p of M(a, b), where 


a= ppt), 5 = palty)*: 


Writing the above formula as ty = i r, we see how to explicitly compute 


Tt) in terms of the simplest values of M(a,b) and M(a+b,a—b). This is 
especially useful when a > b > 0. Here, if we set c = ./a* — b’, then, 
using the notation of § 1, the simplest values are M(a, b) and M(a, c), so that 


2.20 =e 
(2.20) “0 = Go)” 


A nice example is when a = me: and b = 1. Then c = 1, which implies 


to = i! Thus M(,/2, 1) = ./2/p(i)? = 1/q(i)?. From § 1 we know M(/2, 1) 
= m/6, which gives us the formulas 


3. THE ARITHMETIC-GEOMETRIC MEAN OF GAUSS (1984) 55 


THE ARITHMETIC-GEOMETRIC MEAN 307 


G/n = 27 V2p(i)? = 271/21 4 2e7*§ 4 2e7 4 De +), 


fi 


(2.21) 


a/n = qi? = 1—2e7-"+2e7*—2e7 *+...)?. 


We will discuss the importance of this in § 3. 


Turning to another topic, note that M(a, b) is clearly homogeneous of 
degree 1, ic. if p is a value of M(a,b), then cp is a value of M(ca, cb) 
for ce C*. Thus, it suffices to study M(i, 5) for be C — {0, +1}. Its values 
are given by p = 1/p(t)’ where k(t) = b. Since kh’: H > C — {0, +1} is a 
local biholomorphism, it follows that M(1, b) is a multiple valued holomorphic 
function. To make it single valued, we pull back to the universal cover 
via k’, giving us M(1, k(t). We thus obtain 


M(1,k'(q)) = t/p(t)?. 


This shows that the agM may be regarded as a meromorphic modular form 
of weight — 1. 
Another interesting multiple valued holomorphic function is the elliptic 
wi2 
integral | (1—k’sin?*)" dp. This is a function of keC — {0, +1}. If 
0 
we pull back to the universal cover via k:  —- C — {0, +1} (recall from 


Step 2 that k(t) = r(x)?/p(t)*), then it is well known that 
2 nf 2 
2 | (1—A(x)?sin?)" 7d = pit? 
0 


(see [36, p. 500]). Combining the above two equations, we obtain 


1 = 2_ 2 ar tein 2A 1/2 
MG, kK) ~ Se Wc (1 —K(z)*sin2)~ 17d , 


which may be viewed as a rather amazing generalization of (1.9). 

Finally, let us make some remarks about the set .@ of values of M(a, b), 
where a and b are fixed. If » denotes the simplest value of M(a, b), then 
it can be shown that |u| > |u’| for p’ <4, and |u| is a strict maximum 
if ang(a, b) A x. This may be proved directly from the definitions (see [35]). 
Another proof proceeds as follows. We know that any p’e€.# can be 
written 


(2.22) n= wheto+d), 
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b 
where ty € F, and (? i) € I,(4). Thus it suffices to prove that | ctp + d| > 1 
Cc 


b 
whenever t)€F, and (’ era This is left as an exercise for the 
c 


reader. 

We can also study the accumulation points of M. Since |cty + d| 
is a positive definite quadratic form in c and d, it follows from (2.22) that 
0 eC is the only accumulation point of .4. This is very satisfying once we 
recall from Proposition 2.1 that 0¢ C is the common limit of all non-good 
sequences {a,} 9 and {b,} 9 coming from (2.1). 

The proof of Theorem 2.2 makes one thing very clear: we have now seen 
“an entirely new field of analysis.” However, before we can say that Gauss’ 
prediction of May 30, 1799 has been fulfilled, we need to show that the 
proof given above reflects what Gauss actually did. Since we know from 
Step 2 about his work with the theta functions p(t), q(t) and r(t) and the 
modular function k(t), it remains to see how he applied all of this to the 
arithmetic-geometric mean. 

The connections we seek are found in several places in Gauss’ notes. 
For example, he states very clearly that if 


(2.23) a=ppr)?, b= war)’, 


then the sequences a, = p p(2"t)”, b, = 1 q(2"t)* satisfy the agM algorithm 
(2.1) with as their common limit (see [12, III, p. 385 and pp. 467-468)). 
This is precisely our Lemma 2.3. In another passage, Gauss defines the 
“einfachste Mittel” (simplest mean) to be the limit of those sequences where 
Re(b,.,/a,) > 0 for all n > 0 (see [12, III, p. 477]). This is easily seen to 
be equivalent to our definition of simplest value when ang(a, b) # m. On the 
same page, Gauss then asserts that for te F2, p is the simplest value of 
M(a, b) for a,b as in (2.23). This is a weak form of Lemma 2.9. Finally, 
consider the following quote from [12, VIII, p. 101]: “In order to solve the 
equation “ = A, one sets A? = n/m and takes the agM of m and n; 
let this be p. One further takes the agM of m and ./m? — n’, or, what 
is the same, of m+n and m—n; let this be 4. One then has t = p/Ad. 
This gives only one value of t; all others are contained in the formula 


_ ot — 2Bi 
~ § — 2yti’ 


, 
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where «, B, y, 6 signify all integers which satisfy the equation a6 — 4By = 1.” 
Recall that Ret > 0, so that our 1 is just ti. Note also that the last 
assertion is not quite correct. 

Unfortunately, in spite of these compelling fragments, Gauss never actually 
stated Theorem 2.2. The closest he ever came is the following quote from 
f12, X.1, p. 219]: “The agM changes, when one chooses the negative value for 
one of n’, n’, n’” etc.: however all resulting values are of the following 
form: 


(2.24) —=—+—-.” 


Here, Gauss is clearly dealing with M(m,n) where m > n > 0. The fraction 
I/u in (2.24) is correct: in fact, it can be shown that if the negative value 
of n™ is chosen, and all other choices are the right choice, then the cor- 
responding value w’ of M(m, n) satisfies 


(see [13, p. 140]). So (2.24) is only a very special case of Theorem 2.2. 
There is one final piece of evidence to consider: the 109th entry in 
Gauss’ mathematical diary. It reads as follows: 


Between two given numbers there are always infinitely many means 
both arithmetic-geometric and harmonic-geometric, the observation of 
whose mutual connection has been a source of happiness for us. 


(See [12, X.i, p. 550]. The harmonic-geometric mean of a and b is 
M(a~',b~")~!.) What is amazing is the date of this entry; June 3, 1800, 
a little more than a year after May 30, 1799. We know from §1 that 
Gauss’ first proofs of Theorem 1.1 date from December 1799. So less than 
six months later Gauss was aware of the multiple valued nature of M(a, b) 
and of the relations among these values! One tantalizing question remains: 
does the phrase “mutual connection” refer only to (2.24), or did Gauss have 
something more like Theorem 2.2 in mind? Just how much did he know 
about modular functions as of June 3, 1800? In order to answer these 
questions, we need to examine the history of the whole situation more 
closely. 
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3. HISTORICAL REMARKS 


The main difficulty in writing about the history of mathematics is that 
so much has to be left out. The mathematics we are studying has a richness 
which can never be conveyed in one article. For instance, our discussion 
of Gauss’ proofs of Theorem 1.1 in no way does justice to the complexity 
of his mathematical thought; several important ideas were simplified or 
omitted altogether. This is not entirely satisfactory, yet to rectify such gaps 
is beyond the scope of this paper. As a compromise, we will explore the 
three following topics in more detail: 


A. The history of the lemniscate, 
B. Gauss’ work on inverting lemniscatic integrals, and 


C. The chronology of Gauss’ work on the agM and theta functions. 


A. The lemniscate was discovered by Jacob Bernoulli in 1694. He gives 


the equation in the form 
XxX + yy = a./xx — yy 


(in §1 we assumed that a = 1), and he explains that the curve has “the 
form of a figure 8 on its side, as of a band folded into a knot, or of a 
lemniscus, or of a knot of a French ribbon” (see [2, p. 609]). “Lemniscus” 
is a Latin word (taken from the Greek) meaning a pendant ribbon fastened 


to a victor’s garland. 
1 


More interesting is that the integral | (1—z*)~*/?dz, which gives one- 
10) 


quarter of the arc length of the lemniscate, had been discovered three years 
earlier in 1691! This was when Bernoulli worked out the equation of the 
so-called elastic curve. The situation is as follows: a thin elastic rod is bent 
until the two ends are perpendicular to a given line L. 
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After introducing cartesian coordinates as indicated and letting a denote OA, 
Bernoulli was able to show that the upper half of the curve is given by the 
equation 


x 2 
(3.1) y= { pee 
o ./a* — 2* 
where 0 < x < a (see [2, pp. 567-600)). 
It is convenient to assume that a = J. But as soon as this is done, 
we no longer know how long the rod is. In fact, (3.1) implies that the 
arc length from the origin to a point (x, y) on the rescaled elastic curve is 


ee 1 
| (1—z*)~'dz. Thus the length of the whole rod is 2| (1 —z*)7 */2dz, 
0 0 


which is precisely Gauss’ 6! 

How did Bernoulli get from here to the lemniscate? He was well aware 
of the transcendental nature of the elastic curve, and so he used a standard 
seventeenth century trick to make things more manageable: he sought 
“an algebraic curve... whose rectification should agree with the rectification 
of the elastic curve” (this quote is from Euler [9, XXT, p. 276)). 

Jacob actually had a very concrete reason to be interested in arc length: 
in 1694, just after his long paper on the elastic curve was published, he 
solved a problem of Leibniz concerning the “isochrona paracentrica” (see 
(2, pp. 601-607]). This called for a curve along which a falling weight recedes 
from or approaches a given point equally in equal times. Since Bernoulli’s 
solution involved the arc length of the elastic curve, it was natural for him 
to seek an algebraic curve with the same arc length. Very shortly thereafter, 
he found the equation of the lemniscate (see [2, pp. 608-612]). So we really 
can say that the arc length of the lemniscate was known well before the 
curve itself. 

But this is not the full story. In 1694 Jacob’s younger brother Johann 
independently discovered the lemniscate! Jacob's paper on the isochrona 
paracentrica starts with the differential equation 


(xdx + ydy)/y = (xdy—ydx)/a, 


which had been derived earlier by Johann, who, as Jacob rather bluntly 
points out, hadn’t been able to solve it. Johann saw this comment for the 
first time when it appeared in June 1694 in Acta Eruditorum. He took up 
the challenge and quickly produced a paper on the isochrona paracentrica 
which gave the equation of the lemniscate and its relation to the elastic 
curve. This appeared in Acta Eruditorum in October 1694 (see [3, pp. 119- 


60 3. THE ARITHMETIC-GEOMETRIC MEAN OF GAUSS (1984) 
312 D. A. COX 


122]), but unfortunately for Johann, Jacob’s article on the lemniscate appeared 
in the September issue of the same journal. There followed a bitter priority 
dispute. Up to now relations between the brothers had been variable, 
sometimes good, sometimes bad, with always a strong undercurrent of com- 
petition between them. After this incident, amicable relations were never 
restored. (For details of this controversy, as well as a fuller discussion of 
Jacob’s mathematical work, see [18].) 

We need to mention one more thing before going on. Near the 
end of Jacob’s paper on the lemniscate, he points out that the y-value 


x 
| z*(a*—z*)~'/2dz of the elastic curve can be expressed as the difference 
0 


of an arc of the ellipse with semiaxes a,/2 and a, and an arc of the 
lemniscate (see [2, pp. 611-612]). This observation is an easy consequence 
of the equation 


s 2d x 2d x 2 ay Af2 
(3.2) at | aoa =| (SS) e&. 
o (at — 2")! o (a* — 24)! 0 \e —2 


What is especially intriguing is that the ratio af 2: 1, so important in Gauss’ 
observation of May 30, 1799, was present at the very birth of the lemniscate. 

Throughout the eighteenth century the elastic curve and the lemniscate 
appeared in many papers. A. lot of work was done on the integrals 


a! 1 
| (1—2*)" /2dz and | z*(1—z*)~1/?dz. For example, Stirling, in a work 
ce] 0 

written in 1730, gives the approximations 


/ 
| = 1.31102877714605987 
0 


1 


ii z dz _ __59907011736779611 
a 
(see [31, pp. 57-58]). Note that the second number doubled is 
1.19814023473559222, which agrees with M(./2, 1) to sixteen decimal places. 
Stirling also comments that these two numbers add up to one half the 
circumference of an ellipse with aft and 1 as axes, a special case of 
Bernoulli’s observation (3.2). 

Another notable work on the elastic curve was Euler’s paper “De miris 
xxdx 


fixe 


contentae” 


proprietatibus curvae elasticae sub equatione y = | 
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which appeared posthumously in 1786. In this paper Euler gives approxima- 
tions to the above integrals (not as good as Stirling’s) and, more importantly, 
proves the amazing result that 


; | 1 dz | 1 g*dz 1 
3. a =F 
Go) o/l—2? o.f/i—z* 4 

(see [9, XXI, pp. 91-118]). Combining this with Theorem 1.1 we see that 


gt 
M(/2 1) = | a 


1 
o/l—-2z 
so that the coincidence noted above has a sound basis in fact. 

We have quoted these two papers on the elastic curve because, as we 
will see shortly, Gauss is known to have read them. Note that each paper 
has something to contribute to the equality M(./2, 1) = 2/&: from Stirling, 
we pet the ratio Sf2: 1, and from Euler we get the idea of using an 
equation like (3.3). 

Unlike the elastic curve, the story of the lemniscate in the eighteenth 
century is well known, primarily because of the key role it played in the 
development of the theory of elliptic integrals. Since this material is thoroughly 
covered elsewhere (see, for example, [1, Ch. 1-3], [8, pp. 470-496], [19, § 1-§ 4] 
and [21, § 19.4]), we will mention only a few highlights. One early worker 
was C. G. Fagnano who, following some ideas of Johann Bernoulli, studied 
the ways in which arcs of ellipses and hyperbolas can be related. One result, 
known as Fagnano’s Theorem, states that the sum of two appropriately 
chosen arcs of an ellipse can be computed algebraically in terms of the 
coordinates of the points involved. He also worked on the lemniscate, 
starting with the problem of halving that portion of the arc length of the 
lemniscate which lies in one quadrant. Subsequently he found methods for 
dividing this arc length into m equal pieces, where n = 2", 3-2” or 5+ 2™. 
These researches of Fagnano’s were published in the period 1714-1720 in an 
obscure Venetian journal and were not widely known. In 1750 he had his 
work republished, and he sent a copy to the Berlin Academy. It was given 
to Euler for review on December 23, 1751. Less than five weeks later, on 
January 27, 1752, Euler read a paper giving new derivations for Fagnano’s 
results on elliptic and hyperbolic arcs as well as significantly new results on 
lemniscatic ares. By 1753 he had a general addition theorem for lemniscatic 
integrals, and by 1758 he had the addition theorem for elliptic integrals 
(see [9, XX, pp. VII-VIII]). This material was finally published in 1761, 


ri 
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and for the first time there was a genuine theory of elliptic integrals. For 
the next twenty years Euler and Lagrange made significant contributions, 
paving the way for Legendre to cast the field in its classical form which 
we glimpsed at the end of § 1. Legendre published his definitive treatise on 
elliptic integrals in two volumes in 1825 and 1826. The irony is that in 
1828 he had to publish a third volume describing the groundbreaking papers 
of Abel and Jacobi which rendered obsolete much of his own work 
(see [23]). 

An important problem not mentioned so far is that of computing tables 
of elliptic integrals. Such tables were needed primarily because of the many 
applications of elliptic integrals to mechanics. Legendre devoted the entire 
second volume of his treatise to this problem. Earlier Euler had computed 
these integrals using power series similar to (1.8) (see also [9, XX, pp. 21-55]), 
but these series often converged very slowly. The real breakthrough came 
in Lagrange’s 1785 paper “Sur une nouvelle méthode de calcul intégral” 
(see [22, pp. 253-312]). Among other things, Lagrange is concerned with 
integrals of the form 


| M dy 
(3.4) peer h. an 
J (1 +p7y) (L+q7y’) 


where M is a rational function of y* and p > q > 0. He defines sequences 
Ps P's P's 5 Us 7's Q, » as follows: 


p = p + (p?-9?)'” .¢' = p — (’-9@’)*”, 
(3.5) 
p’ = p’ + (@? -q?)'? ,.q" = p — (p?-q”)'”, 


and then, using the substitution 


: a y((1+p?y’) (1+q?y*))*/2 
(3.6) y= — 

he shows that 

(3.7) ((1 + p?y?) (1+?) */2dy = (1+p2y?) (1+q?y2))- dy’. 


Two methods of approximation are now given. The first starts by 
observing that the sequence p, p’, p”,... approaches +00 while q, q’, q’,... 
approaches 0. Thus by iterating the substitution (3.6) in the integral of (3.4), 
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one can eventually assume that g = 0, which gives an easily computable 
integral. The second method consists of doing the first backwards: from (3.5) 
one easily obtains 


p=(pt+qv2, g=(pa)”. 


Lagrange then observes that continuing this process leads to sequences 
D's Ds Pr "Dy G's Gs ‘Gs GQ, Which converge to a common limit (see [22, 
p. 271)}). Hence iterating (3.6) allows one to eventually assume p = q, 
again giving an easily computable integral. 

So here we are in 1785, staring at the definition of the arithmetic- 
geometric mean, six years before Gauss’ earliest work on the subject. By 
setting py = tand, one obtains 


(1+ p?y?} (L+q?y?))" dp = (p?cos*+q’sin2)" /*#do , 


so that (3.6) and (3.7) are precisely (1.5) and (1.6) from the proof of The- 
orem 1.1. Thus Lagrange not only could have defined the agM, he could 
have also proved Theorem 1.1 cffortlessly. Unfortunately, none of this 
happened; Lagrange never realized the power of what he had discovered. 

One question emerges from all of this: did Gauss ever see Lagrange’s 
article? The library of the Collegium Carolinum in Brunswick had some of 
Lagrange’s works (see [4, p. 9]) and the library at Gottingen had an 
extensive collection (see [12, X.2, p. 22]). On the other hand, Gauss, in 
the research announcement of his 1818 article containing the proof of 
Theorem 1.1, claims that his work is independent of that of Lagrange and 
Legendre (see [12, IIL, p. 360]). A fuller discussion of these matters is in 
[12, X.2, pp. 12-22]. Assuming that Gauss did discover the agM independently, 
we have the amusing situation of Gauss, who anticipated so much in Abel, 
Jacobi and others, himself anticipated by Lagrange. 

The elastic curve and the lemniscate were equally well known in the 
eighteenth century. As we will soon see, Gauss at first associated the integral 


fans with the elastic curve, only later to drop it in favor of the 


lemniscate. Subsequent mathematicians have followed his example. Today, the 
elastic curve has been largely forgotten, and the Jemniscate has suffered the 
worse fate of being relegated to the polar coordinates section of calculus 
books. There it sits next to the formula for are length in polar coordinates, 
which can never be applied to the lemniscate since such texts know nothing 
of elliptic integrals. 
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B. Our goal in describing Gauss’ work on the lemniscate is to learn 
more of the background to his observation of May 30, 1799. We will see 
that the lemniscatic functions played a key role in Gauss’ development of 
the arithmetic-geometric mean. 

Gauss began innocently enough in September 1796, using methods of 
Euler to find the formal power series expansion of the inverse function of 


first Ja-2y mas and then more generally [a-strax (see [12, X.1, 


p. 502]). Things became more serious on January 8, 1797. The Slst entry 
in his mathematical diary, bearing this date, states that “I have begun to 


investigate the elastic curve depending on faqs Notes written 


at the same time show that Gauss was reading the works of Euler and 
Stirling on the elastic curve, as discussed earlier. Significantly, Gauss later 
struck out the word “elastic” and replaced it with “lemniscatic” (see [12, X.1, 
pp. 147 and 510)). 


Gauss was strongly motivated by the analogy to the circular functions. 
1 


For example, notice the similarity between &/2 - | (1—z*)~1/2dz and 
10) 


1 
Tj 2 = | (1—z?) ‘/?dz. (This similarity is reinforced by the fact that many 
0 


eighteenth century texts used © to denote m — see [12, X.2, p. 33].) 
Gauss then defined the lemniscatic functions as follows: 


sinlemn (| (1s) "ar a 
0 
coslemn (0 — | (124 as) = 
0 
(see [12, III, p. 404]). Gauss often used the abbreviations slo and clo 


for sinlemn } and coslemn $ respectively, a practice we will adopt. From 
Euler’s addition theorem one easily obtains 


(3.8) sl + cl? + sl? cl’ = 1 
, _ slodd’ tsi ddd 
2) Ko+0) = 1 aodddodd 


(see [12, X.1, p. 147]). 
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Other formulas can now be derived in analogy with the trigonometric 
functions (see [25, pp. 155-156] for a nice treatment), but Gauss went much, 
much farther. A series of four diary entries made in March 1797 reveal the 
amazing discoveries that he made in the first three months of 1797. We will 
need to describe these results in some detail. 

Gauss started with Fagnano’s problem of dividing the Jemniscate into n 
equal parts. Since this involved an equation of degree n”, Gauss realized that 
most of the roots were complex (see [12, X.1, p. 515]). This led him to 
define st @ and cl $ for complex numbers >. The first step is to show that 


sliy) = isly, cl{iy) = L/elfy), 


(the first follows from the change of variable z = iz’ in [a-s71¢, 


and the second follows from (3.8)). Then (3.9) implies that 


slx +islyelxcly 


sl(x +i¥) = ———_—__— 
au ey —isixslyclx 


(see [12, X.1, p. 154). 

It follows easily that sl is doubly periodic, with periods 2@ and 2:6. 
The zeros and poles of st are also easy to determine; they are given by 
b = (mt+in)® and b = ((2m—1)+ i(2n— 1)} (0/2), m, n © Z, respectively. Then 
Gauss shows that sl can be written as 


_ MO) 
N(o) 


where M(o) and N() are entire functions which are doubly indexed 
infinite products whose factors correspond to the zeros and poles respect- 
ively (see (12, X.1, pp. 153-155]). In expanding these products, Gauss 
writes down the first examples of Eisenstein series (see [12, X.1, pp. 515-516]). 
He also obtains many identities involving M() and N(), such as 


sl b 


(3.10) N(2) = M(o)* + N(o)* 


(see [12, X.1, p. 157]). Finally, Gauss notices that the numbers N(®) 
and e”? agree to four decimal places (see [12, X.1, p. 158]). He comments 
that a proof of their equality would be “a most important advancement 
of analysis” (see 12, X.1, p. 517). 

Besides being powerful mathematics, what we have here is almost a 
rehearsal for what Gauss did with the arithmetic-geométric mean: the 
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observation that two numbers are equal, the importance to analysis of 
proving this, and the passage from real to complex numbers in order to 
get at the real depth of the subject. Notice also that identities such as 
(3.10) are an important warm-up to the theta function identities needed 
in § 2. 

Two other discoveries made at this time require comment. First, only 
a year after constructing the regular 17-gon by ruler and compass, Gauss 
found a ruler and compass construction for dividing the lemniscate into 
five equal pieces (see [12, X.1, p. 517]). This is the basis for the remarks 


concerning | (1—x*)~1/*dx made in Disquisitiones Arithmeticae (see [11, 


§ 335]). Second, Gauss discovered the complex multiplication of elliptic 
functions when he gave formulas for sl(1+i)b, N(1+i)o, etc. (see [12, III, 
pp. 407 and 411]). These discoveries are linked: complex multiplication on 
the elliptic curve associated to the lemniscate enabled Abel to determine 
all n for which the lemniscate can be divided into n pieces by ruler and 
compass. (The answer is the same as for the circle! See [28] for an 
excellent modern account of Abel’s theorem.) 

After this burst of progress, Gauss left the lemniscatic functions to work 
on other things. He returned to the subject over a year later, in July 1798, 
and soon discovered that there was a better way to write sl > as a quotient 


: F : : . {% 
of entire functions. The key was to introduce the new variable s = sin (= ®). 


Since sl d has period 2, it can certainly be written as a function of s. 
By expressing the zeros and poles of slo in terms of s, Gauss was able 
to prove that 


_ PC) 
Ol)’ 


| 6 4s? 4s? 


4s? 4s? 


(see [12, ITI, pp. 415-416]). Relating these to the earlier functions M(o) 
and N(), Gauss obtains (letting 6 = 0) 


M(Wa) =e"? P(yés) , 


sl 


where 
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N(Wa) = e"? OWE), 


(see [12, IH, p. 416]). Notice that N(®) = e*? is an immediate consequence 
of the second formula. 

Many other things were going on at this time. The appearance of 2/6 
sparked Gauss’ interest in this ratio. He found several ways of expressing 
&/n, for example 


@ 4/2 ht. 37 7S 71 
ee) 2 = ¥?(14(5) ;+(3) ++ (3) oo: 
and he computed @/n to fifteen decimal places (see [12, X.1, p. 169]). 
He also returned to some of his earlier notes and, where the approximation 


1 
2 | z7(1—z*)" /2dz = 1.198 appears, he added that this is /@ (see [12, X-1, 
0 


pp. 146 and 150]). Thus in July 1798 Gauss was intimately familiar with 


the right-hand side of the equation M(/2, 1) = x/. Another problem he 
studied was the Fourier expansion of si >. Here, he first found the numerical 
value of the coefficients, ie. 


sl yO = .95500599 sin Wn — .04304950 sin 3 Wn + ..., 
and then he found a formula for the coefficients, obtaining 


An 4n 


aera we"? 4 en ee @(e3%?? 4 e382) 


sin 3 ya +... 
see (12, X.1, p. 168 and TL, p. 417)). 

The next breakthrough came in October 1798 when Gauss computed the 
Fourier expansions of P(d) and Q(o). As above, he first computed the 
coefficients numerically and then tried to find a general formula for them. 
Since he suspected that numbers like e~", e~*7, etc, would be involved, 
he computed several of these numbers (see [12, III, pp. 426-432]). The final 
formulas he found were 


Pda) = 


25/4 (n/5) 1/2 Ga sin yx — e794 sin 3 Yn + e7 75%4 sin Sa — =) 


(3.12) Qa) = 


27 V4(n/Gs)t/? (: + 2e7* cos 2x + 2e74** cos 4m + 2e7%* cos Ga + :) 
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(see [12, X.1, pp. 536-537]). A very brief sketch of how Gauss proved 
these formulas may be found in [12, X.2, pp. 38-39]. 

These formulas are remarkable for several reasons. First, recall the theta 
functions ©, and @,: 


©,(z,q) = 2q'/* sin z — 2q°/* sin 3z + 2q?>/* sin 5z — ... 
(3.13) 
©3(z,q) = 1 + 2q cos 2z + 2q* cos 4z + 2q? cos 6z + ... 


(see [36, p. 464]). Up to the constant factor 2 ‘/*(n/@)/*, we see that 
P(W&) and Q(\) are precisely ©,(W2,e~") and ©,(x,e~*) respectively. 
Even though this is just a special case, one can easily discern the general 
form of the theta functions from (3.12). (For more on the relation between 
theta functions and sl 9, see [36, pp. 524-525)). 

Several interesting formulas can be derived from (3.12) by making specific 
choice for y. For example, if we set y = 1, we obtain 


Jon = 27414207" 420-487 4.2097 4._.), 


Also, if we set y = 1/2 and use the nontrivial fact that P(@/2) = Q(6/2) 
= 2~*/4 (this is a consequence of the formula Q(2o) = P()* + O(o)* — see 
(3.10)), we obtain 


[65/1 = Qe 94 4 @7 4 4. 2584 4.) 
(3.14) 


/O/n = 1 — 2e7* + 2e°** — 26° + 


Gauss wrote down these last two formulas in October 1798 (see [12, III, 
p. 418]). We, on the other hand, derived the first and third formulas as 
(2.21) in §2, only after a very long development. Thus Gauss had some 
strong signposts to guide his development of modular functions. 

These results, all dating from 1798, were recorded in Gauss’ mathematical 
diary as the 9ist and 92nd entries (in July) and the 95th entry (in 
October). The statement of the 92nd entry is especially relevant: “I have 
obtained most elegant results concerning the lemniscate, which surpasses 
all expectation—indeed, by methods which open an entirely new field to us” 
(see [12, X.1, p. 535]). There is a real sense of excitement here; instead 
of the earlier “advancement of analysis” of the 63rd entry, we have the much 
stronger phrase “entirely new field.” Gauss knew that he had found 
something of importance. This feeling of excitement is confirmed by the 
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95th entry: “A new field of analysis is open before us, that is, the investigation 
of functions, etc.” (see [12, X.1, p. 536]). It’s as if Gauss were so enraptured 
he didn't even bother to finish the sentence. 

More importantly, this “new field of analysis” is clearly the same 
“entirely new field of analysis” which we first saw in § 1 in the 98th entry. 
Rather than being an isolated phenomenon, it was the culmination of years 
of work. Imagine Gauss’ excitement on May 30, 1799: this new field which 
he had seen grow up around the lemniscate and reveal such niches, all of a 
sudden expands yet again to encompass the arithmetic-geometric mean, a 
subject he had known since age 14. All of the powerful analytic tools he had 
developed for the lemniscatic functions were now ready to he applied to 
the agM. 


C. In studying Gauss’ work on the agM, it makes sense to start by 


asking where the observation M(/2, 1) = n/@ came from. Using what we 
have learned so far, part of this question can now be answered: Gauss was 
very familiar with 2/@, and from reading Stirling he had probably seen the 
ratio fds 1 associated with the lemniscate. (In fact, this ratio appears in most 
known methods for constructing the lemniscate—see [24, pp. 111-117].) We 
have also seen, in the equation N(®) = e”*, that Gauss often used numerical 
calculations to help him discover theorems. But while these facts are 
enlightening, they still leave out one key ingredient, the idea of taking the 


agM of J2 and 1. Where did this come fram? The answer is that every 
great mathematical discovery is kindled by some intuitive spark, and in our 
case, the spark came on May 30, 1799 when Gauss decided to compute 
M(/2, D. 

We are still missing one piece of our picture of Gauss ai this time: 
how much did he know about the agM? Unfortunately, this is a very 
difficult question to answer. Only a few scattered fragments dealing with the 
agM can be dated before May 30, 1799 (see [12, X.1, pp. 172-173 and 260]). 
As for the date 1791 of his discovery of the agM, it comes from a letter 
he wrote in 1816 (see [12, X.1, p. 247]), and Gauss is known to have 
been sometimes wrong in his recollections of dates. The only other knowledge 
we have about the agM in this period is an oral tradition which holds 
that Gauss knew the relation between theta functions and the agM in 1794 
{see (12, II], 493]). We will soon see that this claim is not as outrageous 
as one might suspect. 

It is not our intention to give a complete account of Gauss’ work on 
the agM. This material is well covered in other places (see [10], [12, X.2, 
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pp. 62-114], [13], [14] and [25]—the middle three references are especially 

complete), and furthermore it is impossible to give the full story of what 

happened. To explain this last statement, consider the following formulas: 
B + (1/4)B? + (9/64)B> + ... = (2214/2 +229/74+..)? = r?, 

(3.15) 


a Pe , vs 4 
‘wa0 = 1 + (1/4)B? + (9/64)B* + ... , 


where B = (1—(b/a)?)'/*. These come from the first surviving notes on the 
agM that Gauss wrote after May 30, 1799 (see [12, X.1, pp. 177-178)). 
If we seta = 1 andb =k’ = ,/1—k?, then B = k, and we obtain 


wie = 1 + (1/4k? + (9/64)k* + ... 
(3.16) 
_ A 2a ae se Ps 
MA, k) 


The first formula is (1.8), and the second, with z = e™?, follows easily 
from what we learned in §2 about theta functions and the agM. Yet the 
formulas (3.15) appear with neither proofs nor any hint of where they came 
from. The discussion at the end of §1 sheds some light on the bottom 
formula of (3.15), but there is nothing to prepare us for the top one. 

It is true that Gauss had a long-standing interest in theta functions, 
going back to when he first encountered Euler’s wonderful formula 


s (17 lon? +ny/2 = I] (=y.. 


n=—c n=1 


The right-hand side appears in a fragment dating from 1796 (see [12, X.1, 
p. 142]), and the 7th entry of his mathematical diary, also dated 1796, gives a 
continued fraction expansion for 


1-—-2+8- 64+... 


Then the 58th entry, dated February 1797, generalizes this to give a con- 
tinued fraction expansion for 


le ee oe BP a PP 


(see [12, X.1, pp. 490 and 513]). The connection between these series and 
lemniscatic functions came in October 1798 with formulas such as (3.14). 
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This seems to have piqued his interest in the subject, for at this time he 
also set himself the problem of expressing 


(3.17) . Lex xPt xo txt... 


as an infinite product (see (12, X.1, p. 538]). Note also that the first 
formula of (3.14) gives r with z = e7%. 

Given these examples, we can conjecture where (3.15) came from. Gauss 
could easily have defined p, g and r in general and then derived identities 
(2.8)-(2.9) (recall the many identities obtained in 1798 for P() and Q()—see 
(3.10) and [12, INI, p. 410]). Then (3.15) would result from noticing that 
these identities formally satisfy the agM algorithm, which is the basic content 
of Lemma 2.3. This conjecture is consistent with the way Gauss initially 
treated z as a purely formal variable (the interpretation z = e~*? was only 
to come later—see (12, X.1, pp. 262-263 and X.2, pp. 65-66)]). 

The lack of evidence makes it impossible to verify this or any other 
reasonable conjecture. But one thing is now clear: in Gauss’ observation 
of May 30, 1799, we have not two but three distinct streams of his thought 
coming together. Soon after (or simultaneous with) observing that M(./2, 1) 
= n/&, Gauss knew that there were inimate connections between lemniscatic 
functions, the agM, and theta functions. The richness of the mathematics we 
have seen is in large part due to the many-sided nature of this confluence. 

There remain two items of unfinished business. From § 1, we want to deter- 
mine more precisely when Gauss first proved Theorem 1.1. And recall from § 2 
that on June 3, 1800, Gauss discovered the “mutual connection” among 
the infinitely many values of M(a, b). We want to see if he really knew the 
bulk of §2 by this date. To answer these questions, we will briefly 
examine the main notebook Gauss kept between November 1799 and 
July 1800 (the notebook is “Scheda Ac” and appears as pp. 184-206 in 
[12, X.1). 

The starting date of this notebook coincides with the 100th entry of 
Gauss’ mathematical diary, which reads “We have uncovered many new 
things about arithmetic-geometric means” (see [12, X.1, p. 544]). After several 
pages dealing with geometry, one all of a sudden finds the formula (3.11) 
for @/nx. Since Gauss knew (3.15) at this time, we get an immediate proof 
of M(./2, 1) = 1x/@. Gauss must have had this in mind, for otherwise why 
would he so carefully recopy a formula proved in July 1798? Yet one could 
also ask why such a step is necessary: isn’t Theorem 1.1 an immediate 
consequence of (3.15)? Amazingly enough, it appears that Gauss wasn’t yet 
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aware of this connection (see [12, X.1, p. 262]). Part of the problem is 
that he had been distracted by the power series, closely related to (3.15), 
which gives the arc length of the ellipse (see [12, X.1, p. 177]). This 
distraction was actually a bonus, for an asymptotic formula of Euler’s for 
the arc length of the ellipse led Gauss to write 


_ G2 -e er =.) 


where x = k~}, and z and k are as in (3.16) (see [12, X.1, pp. 186 and 
268-270]). He was then able to show that the power series on top was 
(k M(1, k’))~', which implies that 


ge apt x 
eRe 5 aL, 


(see [12, X.1, pp. 187 and 190]). Letting z = e*/*, we obtain formulas 
similar to (2.20). More importantly, we see that Gauss is now in a position 
to uniformize the agM;; z is no longer a purely formal variable. 

In the process of studying (3.18), Gauss also saw the relation between 
the agM and complete elliptic integrals of the first kind. The formula 


l 2 . 2 25,2)\\— 1/2 
M(U,k) © ade) dx 


follows easily from [12, X.1, p. 187], and this is trivially equivalent to (1.7). 
Furthermore, we know that this page was written on December 14, 1799 
since on this date Gauss wrote in his mathematical diary that the agM 
was the quotient of two transcendental functions (see (3.18)), one of which 
was itself an integral quantity (see the 101st entry, [12, X.1, 544]). Thus 
Theorem 1.1 was proved on December 14, 1799, nine days earlier than our 
previous estimate. 

Having proved this theorem, Gauss immediately notes one of its 
corollaries, that the “constant term” of the expression (1+ cos?d)~?/ is 
M(./1+u, 1)~* (see [12, X.1, p. 188]). By “constant term” Gauss means 
the coefficient A in the Fourier expansion 


(L1+ucos*p)~ 1/2? = A + A’cosd + A” cos2o +... . 


2 n/2 
Since A is the integral - | (1+ cos*)~1/*do, the desired result follows 
10) 


from Theorem 1.1. This interpretation is important because these coefficients 
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are useful in studying secular perturbations in astronomy (see [12, X.1, 
pp. 237-242]). It was in this connection that Gauss published his 1818 paper 
[12, If], pp. 331-355] from which we got our proof of Theorem 1.1. 

What Gauss did next is unexpected: he used the agM to generalize the 
lemniscate functions to arbitrary elliptic functions, which for him meant inverse 
functions of elliptic integrals of the form 


{o +p?sin2) do = {co — x?) (14 p2x?2))" V2dx , 


Note that = 1 corresponds to the lemniscate. To start, he first set 
= tan y, 


COS V T™ COS V 


, 


=, = 
M(1, cosy) ’ M(1, sinv) 


and finally 
n oy n M(, ee) 
(3.19) z= exp| — ay exp| — 2° M(.sinv | 
us wf T(o) 
Then he defined the elliptic function S{$) by S(o) = We) where 


TOpo) = 2n7¥2./MG, cosy) (24sinyn—2??sin3yr +...) 
(3.20) 


Wwe) = M1, cosv) (1 + 2z?cos2n+ 2z%cos4yn +...) 


(see (12, X.1, pp. 194-195 and 198]). In the pages that follow, we find 
the periods 24 and 2i6', the addition formula, and the differential equation 
connecting S(o} to the above elliptic integral. Thus Gauss had a complete 
iheory of elliptic functions. 

In general, there are two basic approaches to this subject. One involves 
direct inversion of the elliptic integral and requires a detailed knowledge of 
the associated Riemann surface (see [17, Ch. VII])}. The other more common 
approach defines elliptic functions as certain series ($$-functions) or quotients 
of series (theta functions). The difficulty is proving that such functions invert 
all elliptic integrals. Classically, this uniformization problem is solved by 
studying a function such as k(t)? (see [36, § 20.6 and § 21.73]) or j(t) (as in 
most modern texts—see [30, §4.2]). Gauss uses the agM to solve this 
problem: (3.19) gives the desired uniformizing parameter! (In this connection, 
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the reader should reconsider the from [12, VIII, p. 101] given near 
the end of § 2.) 

For us, the most interesting aspect of what Gauss did concerns the 
functions T and W. Notice that (3.20) is a direct generalization of (3.12); 
in fact, in terms of (3.13), we have 


T(W®) = p~*? \/M(1, cosv) © ,(Wx, 27), 


WW) = ./M(1, cosv) @, (Wx, 2”). 


Gauss also introduces T(@/2—) and W(®/2—®), which are related to the 
theta functions @, and ©, by similar formulas (see [12, X.1, pp. 196 and 
275]). He then studies the squares of these functions and he obtains 
identities such as 


2©3(0, 2*) O3(20, z*) = ©3(6, 2’)? + Oa(d, 27)” 


(this, of course, is the modern formulation—see [12, X.1, pp. 196 (Eq. 14) 
and 275]). When o =' 0, this reduces to the first formula 


p(t)? + g(t)? = 2p(21)? 


of (2.8). The other formulas of (2.8) appear similarly. Gauss also obtained 
product expansions for the theta functions (see [12, X.1, pp. 201-205]). 
In particular, one finds all the formulas of (2.6). These manipulations yielded 
the further result that 


L+z+29 4 2827/9 4+... — [] (1-z)"'(1—z?), 
n=1 


solving the problem he had posed a year earlier in (3.17). 

From Gauss’ mathematical diary, we see that the bulk of this work was 
done in May 1800 (see entries 105, 106 and 108 in [12, X.1, pp. 546-549)). 
The last two weeks were especially intense as Gauss realized the special 
role played by the agM. The 108th entry, dated June 3, 1800, announces 
completion of a general theory of elliptic functions (“sinus lemniscatici univer- 
salissime accepti”). On the same day he recorded his discovery of the “mutual 
connection” among the values of the agM! 

This is rather surprising. We’ve seen that Gauss knew the basic identities 
(2.6), (2.8) and (2.9), but the formulas (2.7), which tell us how theta functions 
behave under linear fractional transformations, are nowhere to be seen, nor 
do we find any hint of the fundamental domains used in § 2. Reading this 
notebook makes it clear that Gauss now knew the basic observation of 
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Lemma 2.3 that theta functions satisfy the agM algorithm, but there is no 
way to get from here to Theorem 2.2 without knowing (2.7). It is not 
until 1805 that this material appears in Gauss’ notes (see [12, X.2, pp. {01- 
103]}. Thus some authors, notably Markushevitch [25], have concluded that 
on June 3, 1800, Gauss had nothing approaching a proof of Theorem 2.2. 

Schlesinger, the last editor of Gauss’ collected works, feels otherwise. 
He thinks that Gauss knew (2.7) at this time, though knowledge of the 
fundamental! domains may have not come until 1805 (see [12, X.2, p. 106)). 
Schlesinger often overestimates what Gauss knew about modular functions, 
but in this case I agree with him. As evidence, consider pp. 287-307 in 
[12, X.1]. These reproduce twelve consecutive pages from a notebook written 
in 1808 (see [12, X.1, p. 322]), and they contain the formulas (2.7), a clear 
statement of the basic observation of Lemma 2.3, the infinite product 
manipulations described above, and the equations giving the division of the 
agM into 3, 5 and 7 parts {in analogy with the division of the lemniscate). 
The last item is especially interesting because it relates to the second half 
of the 108th entry: “Moreover, in these same days, we have discovered the 
principles according to which the agM series should be interpolated, so as 
to enable us to exhibit by algebraic equations the terms in a given pro- 
gression pertaining to any rational index” (see [12, X.1, p. 548]). There is no 
other record of this in 1800, yet here it is in 1808 resurfacing with other 
material (the infinite products) dating from 1800. Thus it is reasonable to 
assume that the rest of this material, including (2.7), also dates from 1800. 
Of course, to really check this conjecture, one would have to study the 
original documents in detail. 

Given all of (2.6}(2.9), it is still not clear where Gauss got the basic 
insight that M(a,b) is a multiple valued function. One possible source of 
inspiration is the differential equation (1.12) whose solution (1.13) suggests 
linear combinations similar to those of Theorem 2.2. We get even closer 
to this theorem when we consiser the periods of S(¢): 


m int 
rely = CON (saa cosv) | Mi, —] 
where m, n are even integers. Gauss’ struggles during May 1800 to understand 
the imaginary nature of these periods (see [12, X.2, pp. 70-7L]) may have 
influenced his work on the agM. (We should point out that the above 
comments are related: Theorem 2.2 can be proved by analyzing the mono- 
dromy group—I’,(4) in this case—of the differential equation (1.12).) On the 
other hand, Geppert suggests that Gauss may have taken a completely different 
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route, involving the asymptotic formula (3.18), of arriving at Theorem 2.2 
(see [14, pp. 173-175]). We will of course never really know how Gauss 
arrived at this theorem. 

For many years, Gauss hoped to write up these results for publication. 
He mentions this in Disquisitiones Arithmeticae (see [11, § 335]) and in the 
research announcement to his 1818 article (see [12, III, p. 358]). Two 
manuscripts written in 1800 (one on the agM, the other on lemniscatic 
functions) show that Gauss made a good start on this project (see [12, III, 
pp. 360-371 and 413-415]). He also periodically returned to earlier work and 
rewrote it in more complete form (the 1808 notebook is an example of 
this). Aside from the many other projects Gauss had to distract him, it is 
clear why he never finished this one: it was simply too big. Given his 
predilection for completeness, the resulting work would have been enormous. 
Gauss finally gave up trying in 1827 when the first works of Abel and 
Jacobi appeared. As he wrote in 1828, “I shall most likely not soon prepare 
my investigations on transcendental functions which I have had for many 
years—since 1798—because I have many other matters which must be cleared 
up. Herr Abel has now, I see, anticipated me and relieved me of the burden 
in regard to one third of these matters, particularly since he carried out all 
developments with great concision and elegance” (see [12, X.1, p. 248]). 

The other two thirds “of these matters” encompass Gauss’ work on the 
agM and modular functions. The latter were studied vigorously in the 
nineteenth century and are still an active area of research today. The agM, 
on the other hand, has been relegated to the history books. This is not 
entirely wrong, for the history of this subject is wonderful. But at the same 
time the agM is also wonderful as mathematics, and this mathematics 
deserves to be better known. 
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Synopsis: 

In this paper, brothers Jonathan and Peter Borwein present a review of the 
recently discovered (as of 1984) quadratically convergent formulas for 7 and ele- 
mentary functions (including some new formulas of their own), complete with a 
rigorous derivation of all the requisite mathematics. The paper is thus an excellent 
self-contained tutorial on the theory of the arithmetic-geometric mean, quadrati- 
cally convergent algorithms (including Newton’s algorithm for computing square 
roots and roots of polynomials), and how these concepts lead to fast algorithms for 
am and elementary functions. They do this without needing to venture into incom- 
plete elliptic integrals and Landen transforms, which were used to various degrees 
by earlier writers. 

This paper has particular significance for the editors of this volume, as its 
appearance in SIAM Review, a publication mostly read in the applied mathematics 
and high-performance computing communities, led to Bailey becoming interested 
in these topics and joining with Jonathan and Peter Borwein in a multi-decade 
collaboration. 
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THE ARITHMETIC-GEOMETRIC MEAN AND FAST COMPUTATION OF 
ELEMENTARY FUNCTIONS* 


J. M. BORWEIN# anv P. B. BORWEINT 


Abstract. We produce a self contained account of the relationship between the Gaussian arithmetic- 
geometric mean iteration and the fast computation of elementary functions. A particularly pleasant algorithm 
for 7 is one of the by-products. 


Introduction. It is possible to calculate 2” decimal places of w using only » iterations 
of a (fairly) simple three-term recursion. This remarkable fact seems to have first been 
explicitly noted by Salamin in 1976 [16]. Recently the Japanese workers Y. Tamura and 
Y. Kanada have used Salamin’s algorithm to calculate x to 2” decimal places in 6.8 
hours. Subsequently 2 places were obtained ([18] and private communication). Even 
more remarkable is the fact that all the elementary functions can be calculated with 
similar dispatch. This was proved (and implemented) by Brent in 1976 [5]. These 
extraordinarily rapid algorithms rely on a body of material from the theory of elliptic 
functions, all of which was known to Gauss. It is an interesting synthesis of classical 
mathematics with contemporary computational concerns that has provided us with these 
methods. Brent’s analysis requires a number of results on elliptic functions that are no 
longer particularly familiar to most mathematicians. Newman in 1981 stripped this 
analysis to its bare essentials and derived related, though somewhat less computationally 
satisfactory, methods for computing a and log. This concise and attractive treatment may 
be found in {15]. 

Our intention is to provide a mathematically intermediate perspective and some bits 
of the history. We shall derive implementable (essentially) quadratic methods for 
computing x and all the elementary functions. The treatment is entirely self-contained 
and uses only a minimum of elliptic function theory. 


1. 3.141592653589793238462643383279502884197, The calculation of x to great 
accuracy has had a mathematical import that goes far beyond the dictates of utility. It 
requires a mere 39 digits of w in order to compute the circumference of a circle of radius 
2 x 10% meters (an upper bound on the distance travelled by a particle moving at the 
speed of light for 20 billion years, and as such an upper bound on the radius of the 
universe) with an error of less than J0~'* meters (a lower bound for the radius of a 
hydrogen atom). 

Such a calculation was in principle possible for Archimedes, who was the first person 
to develop methods capable of generating arbitrarily many digits of x, He considered 
circumscribed and inscribed regular n-gons in a circle of radius 1. Using n = 96 he 
obtained 


6336 14688 


2 : — 3.1428. 
3017.5 ~" <46735 > 1478 


3.1405--- 


If 1/A, denotes the area of an inscribed regular 2”-gon and 1/8, denotes the area of a 
circumscribed regular 2”-gon about a circle of radius | then 
Anat + B, 


(1.1) An as VA,B,, Bast = 5 


*Received by the editors February 8, 1983, and in revised form November 2!, 1983. This research was 
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This two-term iteration, starting with A,:='4 and B,:= 4, can obviously be used to 
calculate x. (See Edwards [9, p. 34].) Aj3, for example, is 3.14159266 which is correct 
through the first seven digits. In the early sixteen hundreds Ludolph von Ceulen actually 
computed x to 35 places by Archimedes’ method [2]. 

Observe that A, := 27" cosec (0/2”) and B, := 2-"~' cotan (0/2"*') satisfy the above 
recursion. So do A, := 2 " cosech (6/2”) and B,:= 2 *"' cotanh (6/2"*'). Since in both 
cases the common limit is 1/@, the iteration can be used to calculate the standard inverse 
trigonometric and inverse hyperbolic functions. (This is often called Borchardt’s algo- 
rithm [6], [19].) 

If we observe that 


1 
“30a 


we see that the error is decreased by a factor of approximately four with cach iteration. 
This is linear convergence. To compute nv decimal digits of 7, or for that matter arcsin, 
aresinh or log, requires O(m) iterations. 

We can, of course, compute a from arctan or arcsin using the Taylor expansion of 
these functions. John Machin (1680-1752) observed that 


Anyi Baas B,) 


1 1 
w = 16arctan (;) — 4arctan (=) 


and used this to compute z to 100 places. William Shanks in 1873 used the same formula 
for his celebrated 707 digit calculation. A similar formula was employed by Leonhard 
Euler (1707-1783): 


1 3 
x = 20 arctan (7) + 8 arctan (=). 
This, with the expansion 


y 2 DAs 
t =~|]1 nes —— eeue) ce 
arctan (x) | PSEA Geet 
where y = x*/(1 + x’), was used by Euler to compute x to 20 decimal places in an hour. 
(See Beckman [2] or Wrench [21] for a comprehensive discussion of these matters.) In 
1844 Johann Dase (1824-1861) computed a correctly to 200 places using the formula 


Fi arctan; arcta ( ret : 
is >) + arctan 5 + arctan 3}: 
Dase, an “idiot savant” and a calculating prodigy, performed this “stupendous task” in 


“just under two months.” (The quotes are from Beckman, pp. 105 and 107.) 
A similar identity: 


1 | 1 
aw = 24 arctan (5) + 8 arctan (=) + 4arctan (5) 


was employed, in 1962, to compute 100,000 decimals of +. A more reliable “idiot savant’’, 
the IBM 7090, performed this calculation in a mere 8 hrs. 43 mins. [17]. 

There are, of course, many series, products and continued fractions for x. However, 
all the usual ones, even cleverly evaluated, require O( vm) operations (+, x, +, ¥_) to 
arrive at x digits of . Most of them, in fact, employ O(7) operations for n digits, which is 
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essentially linear convergence. Here we consider only full precision operations. For a time 
complexity analysis and a discussion of time efficient algorithms based on binary splitting 
see [4]. 

The algorithm employed in [17] requires about 1,000,000 operations to compute 
1,000,000 digits of +. We shall present an algorithm that reduces this to about 200 
operations. The algorithm, like Salamin’s and Newman’s requires some very clementary 
elliptic function theory. The circle of ideas surrounding the algorithm for = also provides 
algorithms for all the elementary functions. 


2. Extraordinarily rapid algorithms for algebraic functions. We need the following 
two measures of speed of convergence of a sequence (a,) with limit L. If there is a constant 
C, so that 


last ~~ L| s Cla, _ Li? 


for all n, then we say that (a,) converges to £ quadratically, or with second order. If there 
is a constant C,> | so that, for all n, 


la, — L|= Cz” 


then we say that (a,) converges to L exponentially. These two notions are closely related; 
quadratic convergence implies exponential convergence and both types of convergence 
guarantee that a, and £ will “agree” through the first O(2") digits (provided we adopt the 
convention that .9999. . .9 and 1.000. . .0 agree through the required number of digits). 

Newton’s method is perhaps the best known second order iterative method. Newton’s 
method computes a zero of f(x) — y by 


LO) = y 
f'n) 
and hence, can be used to compute f ~' quadratically from f, at least locally. For our 


purposes, finding suitable starting values poses little difficulty. Division can be performed 
by inverting (1/x) — y. The following iteration computes 1/y: 


(2.1) Xap (i= X_ 


(2.2) Xn i= 2Xq_  XqV- 


Square root extraction (vy) is performed by 


l y 
(2.3) Xnal== (x. + 2) 

2 n 
This ancient iteration can be traced back at least as far as the Babylonians. From (2.2) 
and (2.3) we can deduce that division and square root extraction are of the same order of 
complexity as multiplication (see [5]}. Let Af(m) be the “amount of work” required to 
multiply two n digit numbers together and let D(n) and S(n) be, respectively, the 
“amount of work” required to invert an # digit number and compute its square root, to ” 
digit accuracy. Then 


D(n) = O(M(n)) 
and 
S(n) = O(M(n)). 


We are not bothering to specify precisely what we mean by work. We could for example 
count the number of single digit multiplications. The basic point is as follows. It requires 
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O(log 2) iterations of Newton’s method (2.2) to compute 1/y. However, at the ith 
iteration, one need only work with accuracy O(2'). In this sense, Newton’s method is 
self-correcting. Thus, 


logn 
D(n) = o(d uo) = O(M(n)) 
provided M (2') = 2M (2' '). The constants concealed beneath the order symbol are not 
even particularly large. Finally, using a fast multiplication, see [12], it is possible to 
multiply two digits numbers in O(n log (n) log log (#)) single digit operations. 

What we have indicated is that, for the purposes of asymptotics, it is reasonable to 
consider multiplication, division and root extraction as equally complicated and to 
consider each of these as only marginally more complicated than addition. Thus, when we 
refer to operations we shall be allowing addition, multiplication, division and root 
extraction. 

Algebraic functions, that is roots of polynomials whose coefficients are rational 
functions, can be approximated (calculated) exponentially using Newton’s method. By 
this we mean that the iterations converge exponentially and that each iterate is itself 
suitably calculable. (See [13],) 

The difficult trick is to find a method to exponentially approximate just one 
elementary transcendental function. It will then transpire that the other elementary 
functions can also be exponentially calculated from it by composition, inversion and so 
on. 

For this Newton’s method cannot suffice since, if fis algebraic in (2.1) then the limit 
is also algebraic. 

The only familiar iterative procedure that converges quadratically to a transcenden- 
tal function is the arithmetic-geometric mean iteration of Gauss and Legendre for 
computing complete elliptic integrals. This is where we now turn. We must emphasize 
that it is difficult to exaggerate Gauss’ mastery of this material and most of the next 
section is to be found in one form or another in [10]. 


3. The real AGM iteration. Let two positive numbers a and 6 with a > b be given. 
Let ap := a, by := b and define 


(3.1) Gay i= 


for ninN. 
One observes, as a consequence of the arithmetic-geometric mean inequality, that 
a, = a,,, = b,,,; = 6, for all x. It follows easily that (a,) and (b,) converge to a common 


at] = 


limit L which we sometimes denote by AG(a, 4). Let us now set 


(3.2) wim an — bp forne A. 
It is apparent that 

1 ce a 
3.3 m1 =~ (a, — 6,) = s—, 
(3.3) Cry 5) (a ) Ad, AL 


which shows that (c,) converges quadratically to zero. We also observe that 
(3.4) Ay = Ane + Crs) and Oy = Fat — Cay) 


which allows us to define a,, b, and c, for negative n. These negative terms can also be 
generated by the conjugate scale in which one starts with aj := a) and b4:= cy and defines 
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(a’,) and (47) by (3.1). A simple induction shows that for any integer n 
(3.5) Oy Egy Dy 2 Cla CO) Se BS 


Thus, backward iteration can be avoided simply by altering the starting values. For future 
use we define the quadratic conjugate k':= yi ~ k* for any k between O and 1. 

The limit of (a,) can be expressed in terms of a complete elliptic integral of the first 
kind, 


do 


3.6 Ha, b):= f°? ; 
a0) (a, 6) i a’ cos’ @ + b’ sin? @ 
In fact 

1 as 
3.7 T(a, b) == SSS 
een @)-> f° (+ Ae +P) 


as the substitution ¢:= @ tan @ shows. Now the substitution of u:= 4 (t — (ab/t)) and 
some careful but straightforward work [15] show that 


(3.8) I(a, b) = i((¢ 5 °|, sab). 


It follows that /(a,, 5,) is independent of 7 and that, on interchanging limit and integral, 


I(@, bo) = lim 1(@,, ,) = FL, £). 
Since the last integral is a simple arctan (or directly from (3.6)) we see that 
cs a 
(3.9) Tao, by) = 5 AG(ao, bo). 


Gauss, of course, had to derive rather than merely verify this remarkable formula. We 
note in passing that AG(-, -) is positively homogeneous. 

We are now ready to establish the underlying limit formula. 

PROPOSITION 1. 


(3.10) tim [log (?) =F(ly 2) =O) 


Proof. Let 
k’ sin 6 dé 


= fe I 
a ea qf Vk? + (kK’)" cos? 8 


x2 fl — k'sin® 
B(k) := ae ee, 
®) i 1 + k’sin @ 
Since | — (K’ sin 6)* = cos’ @ + (k sin 9)’ = (k’ cos 6)? + k’, we can check that 
I(1, &) = Afk) + Bk). 


Moreover, the substitution uz := k’ cos # allows one to evaluate 


1+) 


and 


vy au 
(3.11) A(k)= f Tape 
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Finally, a uniformity argument justifies 


xj2 cos 0 dé 


[ene eo 


(3.12) lim B(k) = B(0) = fe 
and (3.11) and (3.12) combine to show (3.10). 0 

It is possible to give various asymptotics in (3.10), by estimating the convergence rate 
in (3.12). 

PROPOSITION 2. For k € (0, 1] 


log (* i) - 10, k)| < 411, k) S$ 4K(8 + |log k1). 


(3.13) 


Proof. Let 


4 
A(k) := log (7) — 1, k). 
As in Proposition |, for k & (0, 1], 


(3.14) 
2 sad /2 1 — k’sin#@ {1 — siné 
A{(k)|s | lo G = lo as pene iarae care 
|AG)| e k 6 k 1 + k’sind 1 + sin 6 
We observe that,since] &k’=1—- yl — kk, 


voli) vol) - 9) 


Also, by the mean value theorem, for cach # there is a y € [0, k], so that 
1~ k’ sind | ji — sin é 
1 + k'sin@ 1 + sin@ 

1-(1_ k)sino Se cee 
“IN ia a eyaine 1 + sind 


wl 


(3.15) 1k sR. 


it 


s 


_fyi+ Gv) sing 2y sin 8 i 
ff —( — ¥) sind “a+ 7)sinody 
2yk 2k? 


~y¥l-—¥)sind yI-G—K)sing 
This yields 


* — k'si — si - dO 
yo Le es do 5240 0? accrree s 22 HU(I, K) 
which combines with (3.14) and (3.15) to show that 
|A(K)} s (1 + 2¥2)K (1, &) S 4770, k). 
We finish by observing that 


ce 
KIC, k) 35 
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allows us to deduce that 
4 
IU, k) S 2rk + log mae O 
Similar considerations allow one to deduce that 
(3.16) |A(k) — A(A)| = 2r|k — A| 


forO<k,h <1/ y2. 

The next proposition gives all the information necessary for computing the elemen- 
tary functions from the AGM. 

PROPOSITION 3. The AGM satisfies the following identity (for all initial values): 


; 4 
(3.17) im 2-* top { =) 5. 
tore a, A 2 
Proof. One verifies that 
T , ¢ , 
37 lim 4,7 (ap, bo) (by (3.9)) 


= lim @,/ (a, 6 ,) (by (3.8)) 


ae 


= lim @,/(2"a,, 2"¢,) (by 3.5)). 


Now the homogeneity properties of /({-, -) show that 


2? 
(2, 2°c,) = — (1.2), 


ft 


Thus 
7 lim + 2(1, 2), 
2 oe a, a, 
and the result follows from Proposition 1. Oo 


From now on we fix @ := aj:= 1 and consider the iteration as a function of by := k 
and cq := k’. Let P, and Q, be defined by 


4a, glen 4, 
(3.18) pe) (M2), Quy = 2, 
Ch a, 
and let P(x) := lim,_. P,(k), Q(k) «= lim, ... @,(k). Similarly let a:= a(k) := lim, a, 
and a’ := a'(k) := lim, ..,, a’, 
THEOREM 1. For 0 < k < 1 one has: 


(a) P(k) = exp (wO(k)), 


16 f{a,-—a@ 
(3.19) (b) Os P(k) — P(k) 35 (==), 


la—a,|+ala’ a, 


(a’)’ 


Proof. (a) is an immediate rephrasing of Proposition 3, while (c) is straightforward. 


() 10k) - a} <2 


88 


4. THE AGM AND FAST COMPUTATION OF ELEMENTARY FUNCTIONS (1984) 


358 J. M. BORWEIN AND P. B. BORWEIN 


To see (b) we observe that 


a\e 
(3.20) P= P,- (“) 
because 4a,,;C,,, = C2, as in (3.3). Since a,,, = a, we see that 
at a, 
OS P,— Pra S f a (=) |. < (1 as 1) p, 
Qn an 
(3.21) 


a, —a 
P, Pris 5 (2) p, 


since a, decreases to a. The result now follows from (3.21) on summation. O 
Thus, the theorem shows that both P and Q can be computed exponentially since (a,) 
can be so calculated. In the following sections we will use this theorem to give 
implementable exponential] algorithms for 7 and then for all the elementary functions. 
We conclude this section by rephrasing (3.19a). By using (3.20) repeatedly we derive 
that 


16 + eg 
(3.22) P=: pill pale 


Let us note that 


ay 2a, 2 


and x,:= b,/a, satisfies the one-term recursion used by Legendre [14] 


em n b, l b, 
Get An + (r+ ) 


2X, 
(3.23) Xi = = Xyi= k. 
Thus, also 
16 Tf ( Ba ) (: a8 
24 P aerate DD gt SD. 
(3.24) m@®) ~ ells a 


When k:= 2 '?, k — k’ and one can explicitly deduce that P(2~"/”) = e". When k = 2 '? 
(3.22) is also given in [16]. 


4. Some interrelationships. A centerpiece of this exposition is the formula (3.17) of 
Proposition 3. 


1 4a, a, 
(4.1) lim tog ( ' =~ lim, 
: Cp 2 re My 


coupled with the observation that both sides converge exponentially. To approximate log x 
exponentially, for example, we first find a starting value for which 


4a,\'"/* 
( ane 
c 


n 


This we can do to any required accuracy quadratically by Newton’s method. Then we 
compute the right limit, also quadratically, by the AGM iteration. We can compute exp 
analogously and since, as we will show, (4.1) holds for complex initial values we can also 
get the trigonometric functions. 
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There are details, of course, some of which we will discuss later. An obvious detail is 
that we require 7 to desired accuracy. The next section will provide an exponentially 
converging algorithm for a also based only on (4.1). The principle for it is very simple. If 
we differentiate both sides of (4.1) we lose the logarithm but keep the ! 

Formula (3.10), of Proposition 1, is of some interest. It appears in King [11, pp. 13, 
38] often without the “4” in the log term. For our purposes the “4” is crucial since without 
it (4.1) will only converge linearly (like (log 4)/2”). King’s 1924 monograph contains a 
wealth of material on the various itcrative methods related to computing elliptic integrals. 
He comments [11, p. 14]: 

“The limit [(4.1) without the “4”] does not appear to be generally known, although 
an equivalent formula is given by Legendre (Fonctions éliptiques, t. 1, pp. 94-101).” 

King adds that while Gauss did not explicitly state (4.1) he derived a closely related 
series expansion and that none of this “appears to have been noticed by Jacobi or by 
subsequent writers on elliptic functions.” This series [10, p. 377] gives (4.1) almost 
directly. 

Proposition 1 may be found in Bowman [3]. Of course, almost all the basic work is to 
be found in the works of Abel, Gauss and Legendre [1], [10] and [14]. (See also [7].) As 
was noted by both Brent and Salamin, Proposition 2 can be used to estimate log given z. 
We know from (3.13) that, forO< k = 10 3, 


ioe (¢) - 10, | < 10k?| log k|. 


By subtraction, for0 <x < 1l,andn = 3, 
(4.2) llog (x) — 71, 107") — (1, 10-"x)] | < n 1072"- 


and we can compute log exponentially from the AGM approximations of the elliptic 
integrals in the above formula. This is in the spirit of Newman’s presentation [15]. 
Formula (4.2) works rather well numerically but has the minor computational drawback 
that it requires computing the AGM for smail initial values. This leads to some linear 
steps (roughly log ()) before quadratic convergence takes over. 

We can use (3.16) or (4.2) to show directly that 7 is exponentially computable. With 
k:=10-"and h:= 107" + 107-” (3.16) yields with (3.9) that, for n= 1, 


T 1 1 
0" #14 < 10'-™, 
log (10™" + 1) - 31 4Gq, 105 ~ AGG 10" + || 
Since] log (x + 1)/x — 1] s x/2 for 0 <x < 1, we derive that 
$ 10" 10” le 
oe 7 Era 10-7) AG(I, 10-7 + 10°] | ee 


Newman [15] gives (4.3) with a rougher order estimate and without proof. This 
analytically beautiful formula has the serious computational drawback that obtaining n 
digit accuracy for 7 demands that certain of the operations be done to twice that 
precision. 

Both Brent’s and Salamin’s approaches require Legendre’s relation: forO<k <1 


(4.4) 70, J, kK) + 10, KI, &) — 10, kM, k) -5 


where J(a, 5) is the complete elliptic integral of the second kind defined by 


J (a, 6) := {” y@ cos’ § + b’ sin” 6 dé. 
if] 
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The elliptic integrals of the first and second kind are related by 


(4.5) Hao bs) = (a — 5° 2%e2) 1a, b) 
n=O 

where, as before, c? = a2 — b2 and a, and 5, are computed from the AGM iteration. 

Legendre’s proof of (4.4) can be found in [3] and [8]. His elegant elementary 
argument is to differentiate (4.4) and show the derivative to be constant. He then 
evaluates the constant, essentially by Proposition 1. Strangely enough, Legendre had 
some difficulty in evaluating the constant since he had-problems in showing that k? log (x) 
tends to zero with k [8, p. 150]. 

Relation (4.5) uses properties of the ascending Landen transformation and is derived 
by King in [11]. 

From (4.4) and (4.5), noting that if k equals 2~'” then so does &’, it is immediate 
that 


7 2AGU,2 7 


4.6 = 
— ee Se aa 


This concise and surprising exponentially converging formula for + is used by both 
Salamin and Brent. As Salamin points out, by 1819 Gauss was in possession of the AGM 
iteration for computing elliptic integrals of the first kind and also formula (4.5) for 
computing elliptic integrals of second kind. Legendre had derived his relation (4.4) by 
1811, and as Watson puts it [20, p. 14] ‘tin the hands of Legendre, the transformation 
[(3.23)] became a most powerful! method for computing elliptic integrals.” (See also [10], 
[14] and the footnotes of [11].) King [11, p. 39] derives (4.6) which he attributes, in an 
equivalent form, to Gauss. It is perhaps surprising that (4.6) was not suggested as a 
practical means of calculating x to great accuracy until recently. 

It is worth emphasizing the extraordinary similarity between (1.1) which leads to 
linearly convergent algorithms for all the elementary functions, and (3.1) which leads to 
exponentially convergent algorithms. 

Brent’s algorithms for the elementary functions require a discussion of incomplete 
elliptic integrals and the Landen transform, matters we will not pursue except to mention 
that some of the contributions of Landen and Fagnano are entertainingly laid out in an 
article by G.N. Watson entitled “The Marquis [Fagnano] and the Land Agent [Land- 
en]” [20]. We note that Proposition 1 is also central to Brent’s development though he 
derives it somewhat tangentially. He also derives Theorem 1 (a) in different variables via 
the Landen transform. 


5. An algorithm for x. We now present the details of our exponentially converging 
algorithm for calculating the digits of +. Twenty iterations will provide over two million 
digits. Each iteration requires about ten operations. The algorithm is very stable with all 
the operations being performed on numbers between '/ and 7. The eighth iteration, for 
example, gives mw correctly to 694 digits. 

THEOREM 2. Consider the three-term iteration with initial values 

a= V2, By 0, 1-24 V2 
given by 
i 
2 


(i) oe i (a/? an a, '!), 
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2 yf Batt 
11 = a ———}, 
( ) Ba, 1 " (é 4 Bp 
2 L + 41 
lll) @, =f, hs. 
( ) nil Buri (5 + Brat 


Then z,, converges exponentially to and 


l 
la, —a7|S—. 


10? 


Proof. Consider the formula 


1 a wa 
5.1 ] 4— = 
eh) 2" oe ‘| 2 a, 
which, as we will see later, converges exponentially at a uniform rate to zero in some 
(complex) neighbourhood of 1/ V2. (We are considering each of a,, By, Cys Gps Bi, Cy aS 
being functions of a complex initial value k,i.c. 4) =k, bp= V¥1 9k, a) = ay = 1.) 
Differentiating (5.1) with respect to & yields 


l fa, ¢ wa, {Ay a, 
5.2 Soke Se ee eae Se 
ee Hla al oale a) 
which also converges uniformly exponentially to zero in some neighbourhood of 1/ 72. 
(This general principle for exponential convergence of differentiated sequences of 


analytic functions is a trivial consequence of the Cauchy integral formula.) We can 
compute a,, b, and c, from the recursions 


ats 
Gay t= 
3 ae b,4/@ 
2 cota 
buat 5 (be — by)s 


where a := 0, 09 := 1, ay := Land by:= k. 

We note that a, and 6, map {z|Re(z) > 0} into itself and that 4, and 6, (for 
sufficiently large 7) do likewise. 

It is convenient to set 


(5.4) a, = and B, oa z 


with 


We can derive the following formulae in a completely elementary fashion from the basic 
relationships for a,, 6, and c, and (5.3): 


68) aban hom - Vi ( 2), 
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(5 6) 1 Ans} Cnsl 2(a, a B,) 


Bans Ons 7 {a, ~ 1, + 1) , 


1 
(5.7) Oni = 5 (on? + a"), 


Br +1 
B.., =ai/2{ 2 
(5.8) a+] n (e *). 


i 

(5.9) Oy, — 1 = Jai (ow — 17, 
ay? (i an an)(B, _ a,) 

2 a(B, + a)’ 
(5.11) Ans — Bast _a + a?) r (on = Bn) 

An+1 1 (B, + ay) (a, 7s 1) 

From (5.7) and (5.9) we deduce that «, > 1 uniformly with second order in compact 
subsets of the open right half-plane. Likewise, we see from (5.8) and (5.10) that 8, —> 1 
uniformly and exponentially. Finally, we set 


(5.10) Bnet — Bast ions 


1 Oy — B, 
12 paren fi Se) 
(5.12) w= (29) 
We see from (5.11) that 
(1 + e/) 
(5.13) Yn 908 o,) Ye 
and also from (5.6) that 
Yn 1 ( Qnst Cas ' 
5.14 = fie 
( ) 1 + Bn ane Anal Cnt 


Without any knowledge of the convergence of (5.1) one can, from the preceding 
relationships, easily and directly deduce the exponential convergence of (5.2), in 
{z||z — |S ¢ < %}. We need the information from (5.1) only to see that (5.2) converges 
to zero. 

The algorithm for « comes from multiplying (5.2) by @,/a, and starting the iteration 


at k:= 27'/*, For this value of k a’, = a,, (a’,) = —d, and 
2 (1 " Gest fast) v 
gett Gast Catt 
which by (5.14) shows that 
Ps ae 
"1448, 


Some manipulation of (5.7), (5.8) and (5.13) now produces (iii). The starting values for 
a,, 8, and y, are computed from (5.4). Other values of k will also lead to similar, but 
slightly more complicated, iterations for z. 
To analyse the error one considers 
Yow! i (i + a,?) 1 


1+Bui 1+8, [28,+to)0 +B) U+8)| 
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and notes that, from (5.9) and (5.10), for n = 4, 


1 1 
la, — 113 oe and 18, — 11S sora: 


(Onc computes that the above holds for 7 = 4.) Hence, 


Yost Yn zy eore 
14+ By: 1+ 8) [toe 
and 
Ya = ep, 
rea “1° 107" 


In fact one can show that the error is of order 2"e~*”"". 

If we choose integers in [6,57'], 0<6<% and perform » operations 
(+, —, X, ~, Vv) then the result is always less than or equal to 6”. Thus, if y > 4, it is not 
possible, using the above operations and integral starting values in [6, 5° '], for every # to 
compute m with an accuracy of O(y~”) in steps. In particular, convergence very much 
faster than that provided by Theorem 2 is not possible. 

The analysis in this section allows one to derive the Gauss-Salamin formula (4.6) 
without using Legendre’s formula or second integrals. This can be done by combining our 
results with problems 15 and 18 in [11]. Indeed, the results of this section make 
quantitative sense of problems 16 and 17 in [il]. King also observes that Legendre’s 
formula is actually equivalent to the Gauss—Salamin formula and that each may be 
derived from the other using only properties of the AGM which we have developed and 
equation (4,5). 

This algorithm, like the algorithms of §4, is not self correcting in the way that 
Newton’s method is. Thus, while a certain amount of time may be saved by observing that 
some of the calculations need not be performed to full precision it seems intrinsic (though 
not proven) that O(log n) full precision operations must be executed to calculate x to n 
digits. In fact, showing that x is intrinsically more complicated from a time complexity 
point of view than multiplication would prove that w is transcendental [5]. 


6. The complex AGM iteration. The AGM iteration 
1 
Qny1 = 2 (4, + ba)s But = VGnPn 


is well defined as a complex iteration starting with a) := 1, by :— z. Provided that z does not 
lie on the negative real axis, the iteration will converge (to what then must be an analytic 
limit). One can sec this gcomctrically. For initial z in the right half-plane the limit is given 
by (3.9). It is also easy to see geometrically that a, and 5, are always nonzero. 

The iteration for x, := 6,/a, given in the form (3.23) as X,4) = 2V%,/X,4 satisfies 

(1 — vx)? 

(6.1) Gai - D= i 
This also converges in the cut plane C — (—~, O]. In fact, the convergence is uniformly 
exponential on compact subsets (see Fig. 1). With each iteration the angle @, between x, 
and 1 is at least halved and the real parts converge uniformly to 1. 

It is now apparent from (6.1) and (3.24) that 


« reo (C(t 


nt | —x, 
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and also, 


O,(k) = “* 


‘ 
a 


converge exponentially to analytic limits on compact subsets of the complex plane that 
avoid 


D:={zEC|z¢ (—~, O] v [1, ~)}. 


Again we denote the limits by P and Q. By standard analytic reasoning it must be that 
(3.19a) still holds for k in D. 

Thus one can compute the complex cxponential---and so also cos and sin— 
exponentially using (3.19). More precisely, one uses Newton’s method to approximately 
solve O(k) =z for k and then computes P,(k). The outcome is e”. One can still perform the 
root extractions using Newton’s method. Some care must be taken to extract the correct 
root and to determine an appropriate starting value for the Newton inversion. For 
example k := 0.02876158 yields O(k) = 1 and P,(k) = e to 8 significant places. If one now 
uses k as an initial estimate for the Newton inversions one can compute e'*” for|0|< 7/8. 
Since, as we have observed, e is also exponentially computable we have produced a 
sufficient range of values to painlessly compute cos @ + isin @ with no recourse to any 
auxiliary computations (other than a and e, which can be computed once and stored). By 
contrast Brent’s trigonometric algorithm needs to compute a different logarithm each 
time. 

The most stable way to compute P, is to use the fact that one may update c,, by 


(6.3) Casi = 


One then computes a,, b, and c, to desired accuracy and returns 
(=y" (2s ue =" 
—— or {|————] . 
C, Cn 


This provides a feasible computation of P,, and so of exp or log. 
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In an entirely analogous fashion, formula (4.2) for log is valid in the cut complex 
plane. The given error estimate fails but the convergence is still exponential. Thus (4.2) 
may also be used to compute all the elementary functions. 


7. Concluding remarks and numerical data. We have presented a development of the 
AGM and its uses for rapidly computing elementary functions which is, we hope, almost 
entirely self-contained and which produces workable algorithms. The algorithm for z is 
particularly robust and attractive. We hope that we have given something of the flavour of 
this beautiful collection of ideas, with its surprising mixture of the classical and the 
modern. An open question remains. Can one entirely divorce the central discussion from 
elliptic integral concerns? That is, can one derive exponential iterations for the elemen- 
tary functions without recourse to some nonelementary transcendental functions? It 
would be particularly nice to produce a direct iteration for e of the sort we have for r 
which docs not rely either on Newton inversions or on binary splitting. 

The algorithm for w has been run in an arbitrary precision integer arithmetic. (The 
algorithm can be easily scaled to be integral.) The errors were as follows: 


Iterate Digits correct Iterate Digits correct 


Formula (4.2) was then used to compute 2 log (2) and log (4), using a estimated as 
above and the same integer package. Up to 500 digits were computed this way. It is worth 
noting that the error estimate in (4.2) is of the right order. 

The iteration implicit in (3.22) was used to compute e” in a double precision Fortran. 
Beginning with k:= 2~'” produced the following data: 


Iterate Py 8k afb, 1 


1.6 x 107' 1.5 x 10°? 
2.8 x 10° 2.8 x 10° 


1.7 x 10-7 9.7 «x Loc} 
< 10°” 12x10? 


Identical results were obtained from (6.3). In this case y, := 4a,/c, was computed by the 
two term recursion which uses x,, given by (3.23), and 


16 1+ x,\" 
(7.1) Yop ers =( 5 - 


One observes from (7.1) that the calculation of y, is very stable. 
We conclude by observing that the high precision root extraction required in the 
AGM [18], was actually calculated by inverting y = 1/x’. This leads to the iteration 


3x, — x3y 


(7.2) Xngi = 2 


for computing y ~'/?. One now multiplies by y to recapture Vy. This was preferred because 
it avoided division. 
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5. A simplified version of the fast algorithms of 
Brent and Salamin (1985) 


Paper 5: D. J. Newman, “A simplified version of the fast algorithms of Brent and 
Salamin,” Mathematics of Computation, vol. 44 (1985), p. 207-210. Reprinted by 
permission of the American Mathematical Society. 


Synopsis: 
In this paper, Newman defines an alternate version of the arithmetic-geometric 
mean, which he denotes h(a, b), defined as follows: 


An+1 = Anbn, bn41 = 2anbn/ (Gn + bn). 


With this formulation, he is able to derive the formulas of Brent and Salamin more 
easily, avoiding the need for elliptic integral functions and Landen transforms, and 
also derives some new formulas. He shows, for instance, that N(h(N + 1,1) — 
h(N,1)) gives 2/m to n-bit accuracy, provided we choose N = 2”, and 


"(Tey ) 


gives 1/log to n-bit accuracy, provided we choose N = «”/4. 


Keywords: Algorithms, Arithmetic-Geometric Mean 
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MATHEMATICS OF COMPUTATION 
VOLUME 44, NUMBER 169 
JANUARY 1985, PAGES 207-210 


A Simplified Version of the Fast Algorithms 
of Brent and Salamin 


By D. J. Newman* 


Abstract. We produce more elementary algorithms than those of Brent and Salamin for, 
respectively, evaluating e* and 7. Although the Gauss arithmetic-geometric process still plays 
a central role, the elliptic function theory is now unnecessary. 


In their remarkable papers, Brent [1] and Salamin [3], respectively, used the theory 
of elliptic functions to obtain “fast” computations of the function e* and of the 
number 7. In both cases rather heavy use of elliptic function theory, such as the 
transformation law of Landen, had to be utilized. Our purpose, in this note, is to 
give a highly simplified version of their constructions. In our approach, for example, 
the incomplete elliptic integral is never used. 

We begin as they did with the Gauss arithmetic-geometric process, T(a, b) = 
((a + b)/2, Vab ) which maps couples with a > b > 0 into same. From the inequali- 


ties 
(a+b)/2-Vab _ {Va - vb a) 
(a+b)/2+Vab \Vva+yo}] ©\a+b}’ 
and 


a 
oo = =: 
vab vab Vb 


we see that T'(a, b) goes to its limiting couple (m, m) (m = m(a, b) the so-called 
arithmetic-geometric mean) with “quadratic” speed. Indeed, m(a, b) is determined 
to n places for an i of around logloga/b + logn. The loglog from the ya/b 
inequality expressing the time till the ratio first goes below 2, and the log from the 
((a — b)/(a + b))? inequality expressing the time for the error squaring to do its 
job. 

Next, we recall Gauss’ beautiful formula: 


oo dx 
m9) ere 
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which follows from the fact that this (complete) elliptic integral is invariant under T. 
This fact, that namely 


ia dx = a dt 
0 f(x? 4 a?)(x? +B) (2 +((a + b)/2)\(P? + ab) 
is a simple consequence of the change of variables t = (x — ab/x)/2. Namely, we 
obtain 
2 2 2 2 2 2 
i= ey +(24?) ait te) 
5° 2 4x? 
(x? + ab)? 


t+ ab= 5 
4x 


5 


0 < x < oo, so that indeed we have 


a ax -{* 2dax 
-o (x? + a®)(x2 +b?) *0 ea 


.s 2 y{ 7? +( ai )(2 + ab) 


Accordingly, a repeated use of this invariance gives 


20 dx = 
Ne Tee y fe eas 


and this is exactly Gauss’ formula. 

Actually, it is handier for us to work with what we might call the harmonic-geo- 
metric mean which can be defined by h(a, b) = ab/m(a, b) or, alternatively, as the 
limit under repeated applications of S, rather than 7, where 

S(a, b) = (Vab ,2ab/(a + b)). 
In these terms Gauss’ formula reads 
1 oo 
h(a, b) = — | 


~oc J(1 + x?/a*)(1 + x?/b7) 


The only place that we actually use this formula is to establish the asymptotic 
formula: 


h(N,1) = = jog AN + O(1/N’). 


(This simple-looking formula certainly deserves an elementary proof independent of 
elliptic integrals, but we are unabie to find one.) 
So begin with 
|. dx 
(G+ x?) + x?/N? ) 


A(N,1) = 
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and observe that the map x — N/x leaves the integrand invariant. Thereby, we 
conclude 
YN ax = i 


0 y(1 + x?)(1 + x?/N?) ) Ww 


which gives us 


A(N,1) = peau 
70 V(1 + x?)(1 + x?/N?) 


(1+ x*)(14+ x2/N 


— ew) (1 - x2/2N? + O(x4/N*)) dx 


0 {Q +x?) 

4 p{_ 1 al 

=— Ix + O(1/N 
0 v1 + x2 Nn? (/ )} 


‘s Tea YN +1)-1/4N) + O(1/N?) 
and so, since 
VN + YN +1 = 2N(1 + 1/(2N + 2/M(N +1), 


we obtain 
log(VN + YN + 1) = log2¥N + 1/4N + o(-5]. 


which together with the previous gives 
4 1 2 1 
h(N,1) = ~log2vN + of 55] = =log4N + of 5]. 


as required. (This result can also be found in [2].) 

Summarizing, then, we have produced a fast method for obtaining n places of 
2 log4N/a (if N is of the size c”). But, and here is the trick, this combination of 7 
and the logarithm can be used to yield both of them separately, and we can thereby 
rederive both Salamin’s and Brent’s results. 

To obtain 7 we examine the difference, A(N + 1,1) — h(N,1), and observe that 
N(A(N + 1,1) — ACN,1)) = 2/2 + O(1/N) which gives n place accuracy for 7 if 
we choose, e.g., N = 2”, 

For the logarithm, on the other hand, we look to the quotient, h(N + 1,1)/h(N, 1). 
This time we obtain 

| A(N +1,1) | | _ yioa(l + 1/N) + O(1/N?) _ 1 

h(N,1) log4N + O(1/N) log4N + O(1/N) 
From this we will be able to evaluate log x throughout the interval (3,9), and so, of 
course, throughout any interval. And thereby, we will be able to obtain e*, the 
inverse function, by the usual use of the (fast) Newton iteration scheme. 

To obtain log x, then, in the interval (3,9), we first calculate N = $x”, a process 
that takes only logn multiplications. But then the above formula becomes, upon 
substitution of this value of N, 


lan? + 1,1 
bnxn| MG" # ED 9) 21 of 2) = 1 + of) 
a n(x") loge " Vx") ~ Toga 93" 


which does give the desired n place evaluation of log x. 
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This trick of “differencing” h(N + 1,1) and h(N,1), of course, carries a price. 
Thus we must compute these two quantities to 2n places and so the running time is 
around twice as long as the corresponding ones of Brent and Salamin. 
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6. Is pi normal? (1985) 


Paper 6: Stan Wagon, “Is pi normal,” The Mathematical Intelligencer, vol. 7 
(1985), p. 65-67. With permission of Springer. 


Synopsis: 

As our earlier Compendium makes clear, mathematicians have been fascinated 
by the decimal expansion (and expansions in other bases) of 7 since the time of 
Archimedes — what sort of number is 7? Questions such as whether 7 is rational 
or not, or algebraic or not, were settled in the 18th and 19th century, respectively. 

But one question, originally raised by Borel in 1909, remains unanswered even 
today: whether or not 7 is normal. A real constant is said to be normal base 10, 
or 10-normal, if every m-long string of digits appears in the decimal expansion of 7 
with limiting frequency 1/10” (with a similar definition for a general base b), and is 
said to be absolutely normal if it is b-normal for all integer bases b simultaneously. 

In this highly readable paper, Wagon introduces the question of the normality 
of 7 in the context of the recently discovered quadratically convergent algorithms. 
He also presents a statistical analysis of the digits of 7 provided by Yasamusa 
Kanada, who had, at the time, just computed a to 10,000,000-digit precision. 
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The Evidence 


Is 7 Normal? 
by Stan Wagon 


The nature of the number 7 has intrigued mathema- 
ticians since the beginning of mathematical history. 
The most important properties of 7 are its irrationality 
and transcendence, which were established in 1761 
and 1882, respectively. In the twentieth century the 
focus has been on a different sort of question, namely 
whether tw, despite being irrational and transcen- 
dental, is normal. 

The idea of normality, first introduced by E. Borel 
in 1909, is an attempt to formalize the notion of a real 
number being random. The definition is as follows: A 
real number x is normal in base 6 if in its representation 
in base b all digits occur, in an asymptotic sense, 
equally often. In addition, for each m, the b™ different 
m-strings must occur equally often. In other words, 
lim,,.~ N(s,4)/n = 6-™ for each m-string s, where N(s,n) 
is the number of occurrences of s in the first x base-b 
digits of x. A number that is normal in all bases is 
called normal. The apparent randomness of 1's digits 
had been observed prior to the precise definition of 
normality. De Morgan, for example, pointed out that 
one would expect the digits to occur equally often, but 
yet the number of 7’s in the first 608 digits is 44, much 
lower than expected. However, it turned out that his 
count was based on inaccurate data. 

There are lots of norma] numbers—Borel proved 
(see Niven or §9,12 of Hardy and Wright) that the set 
of non-normal numbers has measure zero—but it is 
difficult to provide concrete examples. While an un- 
dergraduate at Cambridge University, D. Champer- 
nowne proved that 0.12345678910111213 ... is normal 
in base 10, but an explicit example of a normal nirmber 
is still lacking. 

The question of 7’s normality only scratches the sur- 


Column Editor: Stan Wagon, Department of Mathematics, Smith 
College, Northampton, Massachusetts 01063, USA. 


face of the deeper question whether the digits of m are 
“random.” That normality is not sufficient follows 
from the observation that a truly random sequence of 
digits ought to be normal when only digits in positions 
corresponding to perfect squares are examined. But if 
all such positions in a normal number are set to 0, the 
number is still normal. On the other hand, more rig- 
orous definitions of “random” exclude 7 because w‘s 
decimal expansion is a recursive sequence. For an en- 
lightening discussion of the general problem of de- 
fining randomness, see the section on ‘What is a 
Random Sequence” in volume 2 of Knuth’s trilogy. 

Thus deeper questions are lurking, but so little is 
known about 7’s decimal expansion that it is reason- 
able to focus on whether 7 is normal to base ten. To 
put our ignorance in perspective, note that it is not 
even known that ail digits appear infinitely often: per- 
haps 


aw = 3.1415926.....01001000100001000001... . 


In order to gather evidence for m’s normality one 
would like to examine as many digits as possible. 
Those who have pursued the remote digits of 7 have 
often been pejoratively referred to as “digit hunters,” 
but certain recent developments have added some 
glamor to the centuries-old hunt. In 1975 Brent and 
Salamin, independently, discovered an algorithm that 
dramatically lowered the time needed to compute 
large numbers of digits of 7. Moreover, the algorithm 
has important connections with efficient algorithms for 
computing various transcendental functions (sin, 
arctan, exp, log, elliptic integrals) to great accuracy. 

The reader is probably aware that various arctangent 
formulas have been central in the computation of 7. 
Early investigators used series such as the one for 
arctan 1/V3, but convergence was much speeded by 
the use of formulas such as the following, by means 
of which Machin computed 100 decimals of m in 1706: 
a = l6arctant/5 - 4arctan1/239. Indeed, this same 
formula was used in the first computer calculation, the 
ENIAC’s computation of 2037 digits in 1949. Com- 
puters have become much faster, and various opti- 
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Year Time Number of digits Computer time per digit 
1949 ~70 hours 2,037 2 minutes 
1958 100 minutes 10,000 0.6 second 
1961 8.43 hours 100,000 1/3 second 
1973 23.3 hours 1,000,000 1/12 second 
1983 <30 hours 16,000,000 <1/155 second 
| Sophisticated algorithms, combined with ever-faster machines, have led to increas- 
ingly more efficient computations of 1's digits during the computer age. 
Table 1 


mizing tricks help speed up computations, but even 
the million-digit computation in 1973 used the same 
sort of formula, this one due to Gauss: 


a = 48arctanl/18 + 32arctanl1/57 + 20arctan1/239. 


A limiting aspect of the arctangent formulas is the 
number of full-precision operations that must be car- 
ried out. By an operation we mean one of +, X, +, 
V. Since large-precision square roots and divisions 
can be performed in essentially as much time as that 
required by a full-precision multiplication (Newton's 
method can be used for both; see Borwein and Bor- 
wein), this measure of complexity (number of full-pre- 
cision operations) is as good as the more usual “time 
complexity’ for comparing algorithms. Now, an ex- 
amination of the rate of convergence of the arctangent 
series shows that the arctangent method uses O(n) 
(i.e., at most cn, c a constant) full-precision operations 
to compute n decimals of 7; for example, the Shanks 
and Wrench computation of 100,000 decimals used just 
under 105,000 full-precision operations. Thus there are 
two basic time costs involved in the pushing of a cal- 
culation from n to 10n digits: 


(1) the number of operations increases by a factor of 
10, and 

(2) the time for each full-precision operation is about 
10 times greater. 


The Brent-Salamin algorithm requires only O (log 7) 
full-precision operations for n digits of 7. Since log n 
barely increases when n is replaced by 10n, the first of 
the two costs just mentioned is almost entirely elimi- 
nated! 

The Brent-Salamin formula uses ideas that go back 
to Gauss and Legendre, but prior to the 1970s no one 
had thought to apply these ideas to evaluate 7. The 
formula exploits the speed of convergence of the de- 
fining sequences for the arithmetic-geometric mean of 
two numbers. Given positive reals a) > bp, their arith- 
metic-geometric mean, AG(ap,bp), is defined to be the 
common limit of {@,}, {b,}, where a, = (4,1 + b,_1)/2 
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and b, = Va,-1 6,1. Gauss had investigated such 
limits and had proved that AG(a,b) equals 1/21, where 
at 


m2 
2 oo 
) Varcos*t + b’sin’t 


a complete elliptic integral of the first kind. To get a 
formula for 7 let ag, by be 1, 1/2, respectively, and 
define d,, to be a2 — b?; then 


T= wacaavayl( - 3 2 . 
jel 


If 7, is defined to be 4a,,(1 — 27_, 2*1d), then 
T, converges quadratically to 7. This means, roughly, 
that the number of correct digits doubles from one 7, 
to the next. 

Quadratic convergence is most familiar from New- 
ton’s method of approximating solutions to algebraic 
equations. Thus, from a computational perspective, 7 
behaves like an algebraic number. 

A more precise error analysis shows that 1. is ac- 
curate to 178,000 digits, 7,, to over a million, 7, to 
over ten million, and 1, to almost 200,000,000 digits. 
Since the computation of 7, requires 7n full-precision 
operations, the improvement over the classical algo- 
rithm is impressive: 100,000 digits require only 112 full- 
precision operations! 

The fact that c log n full-precision operations yield x 
digits of + means that the time complexity (essentially, 
number of bit operations) of the computation is 
O (n log’n loglog 1); this uses the fact that n-digit mul- 
tiplication is of complexity O(n log n loglog 1). Thus 
these fast algorithms for m are just about the fastest 
possible: it takes n steps just to write down n digits, 
and there is not much room between n and 1 log’n 
loglog x. 

For refinements of the Brent-Salamin formulas, ap- 
plications to the computation of transcendental func- 
tions, and some proofs, see the paper by Borwein and 
Borwein, who are preparing a book on the arithmetic- 
geometric mean. See the paper by Cox for more on the 


Type of hand 
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Expected number 


Actual number 


No two digits the same 604,800 604,976 
One pair 1,008,000 1,007,151 
Two pair 216,000 216,520 
Three of a kind 144,000 144,375 
Full house 18,000 17,891 
Four of a kind 9,000 8,887 

| Five of a kind 200 200 
Distribution of the first two million poker hands in the digits of 7. 

Table 2 


arithmetic-geometric mean and for a historical account 
of Gauss’s work. 

There is a surprising connection between these 
modern algorithms and the method of Archimedes. 
Archimedes used inscribed and circumscribed poly- 
gons to approximate 7. Now, if A, is the reciprocal of 
the circumference of a 2"-gon inscribed in a unit circle 
and B, likewise for a circumscribed 2"-gon, then A, 
and B, satisfy: B,,; = 4A, + B,) and A,,, = 
VA,Bn+1- Thus the double sequence of Archimedes 
obeys a recursion almost identical to the defining re- 
cursion for AG(1/2, 1/4). Archimedes’ sequences con- 
verge much more slowly, however: each iteration de- 
creases the error by, approximately, a factor of four. 

The Brent-Salamin algorithm has been implemented 
in Japan by Kanada, Tamura, Yoshino, and Ushiro 
who, in 1983, used it to compute 16 million decimal 
places. They checked the first 10,013,395 of these using 
Gauss’s arctangent relation. The computation of 16 
million digits took less than 30 hours of CPU time, 
although some time (10-20%) was saved by the reuse 
of intermediate values from earlier computations. See 
Table 1 for a comparison with previous computations. 
The check took only 24 hours, but the two times are 
not comparable since the arctangent computation was 
performed on a much faster computer, a Japanese Hi- 
tachi supercomputer with a speed of 630 MFLOPS 
(million floating-point operations per second). 

A forthcoming paper by Kanada contains a statistical 
analysis of the first ten million digits, which show no 
unusual deviation from expected behavior. The fre- 
quencies for each of the ten digits are: 999,440; 999,333; 
1,000,306; 999,964; 1,001,093; 1,000,466; 999,337; 
1,000,207; 999,814; and 1,000,040. Moreover, the speed 
with which the relative frequencies are approaching 
1/10 agrees with theory. Consider the digit 7 for ex- 
ample. Its relative frequencies in the first 10! digits 
(i = 1,..., 7) are 0, .08, .095, .097, .10025, .0998, 
1000207, which seem to be approaching 1/10 at the 
speed predicted by probability theory for random 
digits, namely at a speed approximately proportional 
to 1/V/n, The poker test is relevant to the question of 


normality in base ten, and Table 2 contains the fre- 
quencies of poker hands from the first ten million 
digits; there is no significant deviation from the ex- 
pected values. 

Writers over the years have been fond of mentioning 
that 20 decimals of 7 suffice for any application imag- 
inable. Moreover, the millions of digits now known 
shed absolutely no light on how to prove 7’s nor- 
mality. But these criticisms miss the point. Huyghens, 
in using an extrapolative techinque to extend Ar- 
chimedes’ calculations, was the first to use an impor- 
tant technique that, in this century, has come to be 
known as the Romberg method for approximating def- 
inite integrals. And the arithmetic-geometric mean al- 
gorithms and their refinements are closely connected 
to the fastest known techniques for evaluating multi- 
precision transcendental functions. Thus digit-hunting 
has an importance that goes beyond the mere exten- 
sion of the known decimal places of 1. 
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7. The computation of 7 to 29,360,000 decimal 
digits using Borweins’ quartically convergent 
algorithm (1988) 


Paper 7: David H. Bailey, “The computation of pi to 29,360,000 decimal digits 
using Borweins’ quartically convergent algorithm,” Mathematics of Computation, 
vol. 50 (1988), p. 283-296. Reprinted by permission of the American Mathematical 
Society. 


Synopsis: 
This paper, written by one of the present editors, describes the computation 

of 7 using a set of formulas that at the time (1988) had just been discovered by 

Jonathan and Peter Borwein: Let ag = 6 — 4/2 and yo = 2 —1. Iterate 

l=G-9)* 

4 92k+3 


Yk+1 = 


Yroi(1 t+ Yeo + Vivi): 


Then a, converges quartically to 1/m: each successive iteration approximately 
quadruples the number of correct digits in the result. 

Bailey also described in detail the computational techniques required to do such 
a long computation, such as the observation that a fast Fourier transform (FFT) 
can be employed to perform high-precision multiplication, and also presented the 
results of detailed statistical analyses of the digits. 

Interestingly, this computation, which was performed on one of the original 
Cray-2 supercomputers while in a test mode, disclosed at least one bug in the 
hardware, which was subsequently rectified. In the wake of this finding, Cray 
employed a similar calculation as a test code to be run on new computers to ensure 
hardware integrity. 


On41 = an(1+ yet) 
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The Computation of 7 to 29,360,000 Decimal Digits 
Using Borweins’ Quartically Convergent Algorithm 


By David H. Bailey 


Abstract. In a recent work [6], Borwein and Borwein derived a class of algorithms based 
on the theory of elliptic integrals that yield very rapidly convergent approximations to 
elementary constants. The author has implemented Borweins’ quartically convergent 
algorithm for 1/7, using a prime modulus transform multi-precision technique, to com- 
pute over 29,360,000 digits of the decimal expansion of #. The result was checked by 
using a different algorithm, also due to the Borweins, that converges quadratically to 
ma. These computations were performed as a system test of the Cray-2 operated by the 
Numerical Aerodynamical Simulation (NAS) Program at NASA Ames Research Center. 
The calculations were made possible by the very large memory of the Cray-2. 

Until recently, the largest computation of the decimal expansion of 7 was due to 
Kanada and Tamura [12] of the University of Tokyo. In 1983 they computed approxi- 
mately 16 million digits on a Hitachi 8-810 computer. Late in 1985 Gosper [9] reported 
computing 17 million digits using a Symbolics workstation. Since the computation de- 
scribed in this paper was performed, Kanada has reported extending the computation 
of + to over 134 million digits (January 1987). 

This paper describes the algorithms and techniques used in the author’s computation, 
both for converging to a and for performing the required multi-precision arithmetic. The 
results of statistical analyses of the computed decimal expansion are also included. 


1. Introduction. The computation of the numerical value of the constant 7 
has been pursued for centuries for a variety of reasons, both practical and theo- 
retical. Certainly, a value of 7 correct to 10 decimal places is sufficient for most 
“practical” applications, Occasionally, there is a need for double-precision or even 
multi-precision computations involving z and other elementary constants and func- 
tions in order to compensate for unusually severe numerical difficulties in an ex- 
tended computation. However, the author is not aware of even a single case of a 
“practical” scientific computation that requires the value of + to more than about 
100 decimal places. 

Beyond immediate practicality, the decimal expansion of 7 has been of interest 
to mathematicians, who have still not been able to resolve the question of whether 
the digits in the expansion of 7 are “random”. In particular, it is widely suspected 
that the decimal expansions of 7, e, /2, /27, and a host of related mathematical 
constants all have the property that the limiting frequency of any digit is one 
tenth, and that the limiting frequency of any n-long string of digits is 10-". Such 
a guaranteed property could, for instance, be the basis of a reliable pseudo-random 
number generator. Unfortunately, this assertion has not been proven in even one 
instance. Thus, there is a continuing interest in performing statistical analyses on 
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the decimal expansions of these numbers to see if there is any irregularity that 
would suggest this assertion is false. 

In recent years, the computation of the expansion of 7 has assumed the role as 
a standard test of computer integrity. If even one error occurs in the computation, 
then the result will almost certainly be completely in error after an initial correct 
section. On the other hand, if the result of the computation of 7 to even 100,000 
decimal places is correct, then the computer has performed billions of operations 
without error. For this reason, programs that compute the decimal expansion 
of m are frequently used by both manufacturers and purchasers of new computer 
equipment to certify system reliability. 


2. History. The first serious attempt to calculate an accurate value for the 
constant 7 was made by Archimedes, who approximated 7 by computing the areas 
of equilateral polygons with increasing numbers of sides. More recently, infinite 
series have been used. In 1671 Gregory discovered the arctangent series 

oe es dg? 
tan (j=t-Ste rat 
This discovery led to a number of rapidly convergent algorithms. In 1706 Machin 
used Gregory’s series coupled with the identity 


m = 16tan~1(1/5) — 4tan—! (1/239) 
to compute 100 digits of 7. 
In the nearly 300 years since that time, most. computations of the value of 7, 


even those performed by computer, have employed some variation of this technique. 
For instance, a series based on the identity 


nm = 24tan~'(1/8) + 8tan~!(1/57) + 4tan~! (1/239) 


was used in a computation of z to 100,000 decimal digits using an IBM 7090 in 
1961 [15]. Readers interested in the history of the computation 7 are referred to 
Beckmann’s entertaining book on the subject [2]. 


3. New Algorithms for 7. Only very recently have algorithms been discov- 
ered that are fundamentally faster than the above techniques. In 1976 Brent [7] 
and Salamin [14] independently discovered an approximation algorithm based on 
elliptic integrals that yields quadratic convergence to 7. With all of the previous 
techniques, the number of correct digits increases only linearly with the number 
of iterations performed. With this new algorithm, each additional iteration of the 
algorithm approximately doubles the number of correct digits. Kanada and Tamura 
employed this algorithm in 1983 to compute 7 to over 16 million decimal digits. 

More recently, J. M. Borwein and P. B. Borwein [4] discovered another quadrat- 
ically convergent algorithm for 7, together with similar algorithms for fast compu- 
tation of all the elementary functions. Their quadratically convergent algorithm 
for x can be stated as follows: Let a9 = V2, bp = 0, pp = 2+ V2. Iterate 


Qn +1/ fa @e(1+6 b l+a 
were (fan +1/Y re) hee Jar( m) Be +1 k+1) 
2 ay, + by 1+ be+1 
Then p; converges quadratically to 7: Successive iterations of this algorithm yield 
3, 8, 19, 40, 83, 170, 345, 694, 1392, and 2788 correct digits of the expansion of 7. 
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However, it should be noted that this algorithm is not self-correcting for numerical 
errors, so that all iterations must be performed to full precision. In other words, in 
a computation of 7 to 2788 decimal digits using the above algorithm, each of the 
ten iterations must be performed with more than 2788 digits of precision. 

Most recently, the Borweins [6] have discovered a genera] technique for obtaining 
even higher-order convergent algorithms for certain elementary constants. Their 
quartically convergent algorithm for 1/7 can be stated as follows: Let ag = 6—4,/2 
and yo = V2 — 1. Iterate 


Yr+1 = a 3 He et ug)” 
1+ (1 ~ yp)/4 
Oks = On(1 + yegr)*® — 27°39 (1+ year + Yer): 


Then a, converges quartically to 1/a: Each successive iteration approximately 
quadruples the number of correct digits in the result. As in the previous case, each 
iteration must be performed to at least the level of precision desired for the final 
result. 


4, Multi-Precision Arithmetic Techniques. A key element of a very high 
precision computation of this sort is a set of high-performance routines for perform- 
ing multi-precision arithmetic. A naive approach to multi-precision computation 
would require a prohibitive amount of processing time and would, as a result, 
sharply increase the probability that an undetected hardware error would occur, 
rendering the result invalid. In addition to employing advanced algorithms for such 
key operations as multi-precision multiplication, it is imperative that these algo- 
rithms be implemented in a style that is conducive for high-speed computation on 
the computer being used. 

The computer used for these computations is the Cray-2 at the NASA Ames 
Research Center. This computation was performed to test the integrity of the 
Cray-2 hardware, as well as the Fortran compiler and the operating system. The 
Cray-2 is particularly well suited for this computation because of its very large 
main memory, which holds 228 = 268, 435, 456 words (one word is 64 bits of data). 
With this huge capacity, all data for these computations can be contained entirely 
within main memory, insuring ease of programming and fast execution. 

No attempt was made to employ more than one of the four central processing 
units in the Cray-2. Thus, at the same time these calculations were being per- 
formed, the computer was executing other jobs on the other processors. However, 
full advantage was taken of the vector operations and vector registers of the system. 
Considerable care was taken in programming to insure that the multi-precision al- 
gorithms were implemented in a style that would admit vector processing. Most key 
loops were automatically vectorized by the Cray-2 Fortran compiler. For those few 
that were not automatically vectorized, compiler directives were inserted to force 
vectorization. As a result of this effort, virtually all arithmetic operations were 
performed in vector mode, which on the Cray-2 is approximately 20 times faster 
than scalar mode. Because of the high level of vectorization that was achieved using 
the Fortran compiler, it was not necessary to use assembly language, nonstandard 
constructs, or library subroutines. 
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A multi-precision number is represented in these computations as an (n + 2)- 
long array of floating-point whole numbers. The first cell contains the sign of the 
number, either 1, —-1, or 0 (reserved for an exact zero). The second cell of the array 
contains the exponent (powers of the radix), and the remaining n cells contain the 
mantissa. The radix selected for the multi-precision numbers is 10’. Thus the 
number 1.23456789 is represented by the array 1,0, 1, 2345678, 9000000, 0,0,...,0. 

A floating-point representation was chosen instead of an integer representation 
because the hardware of numerical supercomputers such as the Cray-2 is designed 
for floating-point computation. Indeed, the Cray-2 does not even have full-word 
integer multiply or divide hardware instructions. Such operations are performed by 
first converting the operands to floating-point form, using the floating-point unit, 
and converting the results back to fixed-point (integer) form. A decimal radix was 
chosen instead of a binary value because multiplications and divisions by powers of 
two are not performed any faster than normal on the Cary-2 (in vector mode). Since 
a decimal radix is clearly preferable to a binary radix for program troubleshooting 
and for input and output, a decimal radix was chosen. The value 10’ was chosen 
because it is the largest. power of ten that will fit in half of the mantissa of a single 
word. In this way two of these numbers may be multiplied to obtain the exact, 
product using ordinary single-precision arithmetic. 

Multi-precision addition and subtraction are not computationally expensive com- 
pared to multiplication, division, and square root extraction. Thus, simple algo- 
rithms suffice to perform addition and subtraction. The only part of these opera- 
tions that is not immediately conducive to vector processing is releasing the carries 
for the final result. This is because the normal “schoolboy” approach of beginning 
at the last cell and working forward is a recursive operation. On a vector super- 
computer this is better done by starting at the beginning and releasing the carry 
only one cell back for each cell processed. Unfortunately, it cannot be guaranteed 
that one application of this process will release all carries (consider the case of two 
or more consecutive 9999999’s, followed by a number exceeding 107). Thus it is 
necessary to repeat this operation until all carries have been released (usually one 
or two additional times). In the rare cases where three applications of this vector- 
ized process are not successful in releasing all carries, the author’s program resorts 
to the scalar “schoolboy” method. 

Provided a fast multi-precision multiplication procedure is available, multi- 
precision division and square root extraction may be performed economically us- 
ing Newton's iteration, as follows. Let 29 and yo be initial approximations to the 
reciprocal of a and to the reciprocal of the square root of a, respectively. Then 
_ ye(3 — ayp) 

2 
both converge quadratically to the desired values. One additional full-precision mul- 
tiplication yields the quotient and the square root, respectively. What is especially 
attractive about these algorithms is that the first iteration may be performed using 
ordinary single-precision arithmetic, and subsequent iterations may be performed 
using a level of precision that approximately doubles each time. Thus the total 
cost of computation is only about twice the cost of the final iteration, plus the one 
additional multiplication. As a result, a multi-precision division costs only about 


Te+1 = ZR(2— arr), Yk+1 
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five times as much as a multi-precision multiplication, and a multi-precision square 
root costs only about seven times as much as a multi-precision multiplication. 


§. Multi-Precision Multiplication. It can be seen from the above that the 
key component of a high-performance multi-precision arithmetic system is the mul- 
tiply operation. For modest levels of precision (fewer than about 1000 digits), some 
variation of the usual “schoolboy” method is sufficient, although care must be taken 
in the implementation to insure that the operations are vectorizable. Above this 
level of precision, however, other more sophisticated techniques have a significant 
advantage. The history of the development of high-performance multiply algo- 
rithms will not be reviewed here. The interested reader is referred to Knuth (13]. 
It will suffice to note that all of the current state-of-the-art techniques derive from 
the following fact of Fourier analysis: Let F(x) denote the discrete Fourier trans- 
form of the sequence z = (z9,21,22,...,%n—1), and let F—+(x) denote the inverse 
discrete Fourier transform of 2: 


N-1 1 N-1 ; 
F(a) = So ajo, FO @) = aw, 
7=0 g=8 


where w = e~?7'/% is a primitive Nth root of unity. Let C(z, y) denote the convo- 


lution of the sequences x and y: 
N-1 
Cr(z,y) = D> 2ye-35 
= 


where the subscript &—7 is to be interpreted as k-—j7 +N if k~7 is negative. Then 
the “convolution theorem”, whose proof is a straightforward exercise, states that 


F(C(z,y)] = F(x) F(y), 
or expressed another way, 
C(z,y) = F~"[F(2)F(y)]. 


This result is applicable to multi-precision multiplication in the following way. 
Let z and y be n-long representations of two multi-precision numbers (without the 
sign or exponent. words). Extend z and y to length 2n by appending n zeros at the 
end of each. Then the multi-precision product z of z and y, except for releasing 
the carries, can be written as follows: 

20 = to¥o 
21 = Zoy1 + T1140 


22 = Toye + F141 + Layo 
2n—-1 = LoYn-1 + L1Yn-2 + °°* + In-1Yo 


22n-3 = In—-1Yn-2 + In-2Yn-1 
2Qn-2 = In-1Yn-1 


Zan-1 = 0. 
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It can now be seen that this “multiplication pyramid” is precisely the convolution 
of the two sequences z and y, where NV = 2n. The convolution theorem states that 
the multiplication pyramid can be obtained by performing two forward discrete 
Fourier transforms, one vector complex multiplication, and one reverse transform, 
each of length N = 2n. Once the resulting complex numbers have been rounded 
to the nearest integer, the final multi-precision product may be obtained by merely 
releasing the carries as described in the section above on addition and subtraction. 

The key computational savings here is that the discrete Fourier transform may 
of course be economically computed using some variation of the “fast Fourier trans- 
form” (FFT) algorithm. It is most convenient to employ the radix two fast Fourier 
transform since there is a wealth of literature on how to efficiently implement this 
algorithm (see [1], [8], and [16]). Thus, it will be assumed from this point that 
N = 2™ for some integer m. 

One useful “trick” can be employed to further reduce the computational require- 
ment for complex transforms. Note that the input data vectors z and y and the 
result vector z are purely real. This fact can be exploited by using a simple pro- 
cedure ([8, p. 169]) for performing real-to-complex and complex-to-real transforms 
that obtains the result with only about half the work otherwise required. 

One important item has been omitted from the above discussion. If the radix 107 
is used, then the product of two cells will be in the neighborhood of 10!*, and the 
sum of a large number of these products cannot be represented exactly in the 48- 
bit mantissa of a Cray-2 floating-point word. In this case the rounding operation 
at the completion of the transform will not be able to recover the exact whole 
number result. As a result, for the complex transform method to work correctly, it 
is necessary to alter the above scheme slightly. The simplest solution is to use the 
radix 10° and to divide all input data into two words with only three digits each. 
Although this scheme greatly increases the memory space required, it does permit 
the complex transform method to be used for multi-precision computation up to 
several million digits on the Cray-2. 


6. Prime Modulus Transforms. Some variation of the above method has 
been used in almost all high-performance multi-precision computer programs, in- 
cluding the program used by Kanada and Tamura. However, it appears to break 
down for very high-precision computation (beyond about ten million digits on the 
Cray-2), due to the round-off error problem mentioned above. The input data can 
be further divided into two digits per word or even one digit per word, but only with 
a substantial increase in run time and main memory. Since a principal goal in this 
computation was to remain totally within the Cray-2 main memory, a somewhat 
different method was used. 

It can readily be seen that the technique of the previous section, including the 
usage of a fast. Fourier transform algorithm, can be applied in any number field in 
which there exists a primitive Nth root of unity w. This requirement holds for the 
field of the integers modulo p, where p is a prime of the form p = kN + 1 ((11, p. 
85]). One significant advantage of using a prime modulus field instead of the field 
of complex numbers is that there is no need to worry about round-off error in the 
results, since all computations are exact. 
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However, there are some difficulties in using a prime modulus field for the trans- 
form operations above. The first is to find a prime p of the form kN + 1, where 
N =2™. The second is to find a primitive Nth root of unity modulo p. As it turns 
out, it is not too hard using a computer to find both of these numbers by direct 
search. Thirdly, one must compute the multiplicative inverse of N modulo p. This 
can be done using a variation of the Euclidean algorithm from elementary number 
theory. Note that each of these calculations needs to be performed one time only. 

A more troublesome difficulty in using a prime modulus transform is the fact 
that the final multiplication pyramid results are only recovered modulo p. If p is 
greater than about 104 then this is not a problem, but the usage of such a large 
prime would require quadruple-precision arithmetic operations to be performed in 
the inner loop of the fast Fourier transform, which would very greatly increase the 
run time. A simpler and faster approach to the problem is to use two primes, 71 
and p2, each slightly greater than 10!*, and to perform the transform algorithm 
above using each prime. Then the Chinese remainder theorem may be applied to 
the results modulo p; and pe to obtain the results modulo the product pipe. Since 
Pp p2 is greater than 1024, these results will be the exact multiplication pyramid 
numbers. Unfortunately, double-precision arithmetic must still be performed in the 
fast Fourier transform and in the Chinese remainder theorem calculation. However, 
the whole-number format of the input data simplifies these operations, and it is 
possible to program them in a vectorizable fashion. 

Borodin and Munro (([3, p. 90]) have suggested using three transforms with three 
primes p1,p2 and p3, each of which is just smaller than half of the mantissa, and 
using the Chinese remainder theorem to recover the results modulo pipop3. In this 
way, double-precision operations are completely avoided in the inner loop of the 
FFT. This scheme runs quite fast, but unfortunately the largest transform that can 
be performed on the Cray-2 using this system is N = 21°, which corresponds to a 
maximum precision of about three million digits. 

Readers interested in studying about prime modulus number fields, the Euclidean 
algorithm, or the Chinese remainder theorem are referred to any elementary text 
on number theory, such as [10] or [11]. Knuth [13] and Borodin [3] also provide 
excellent information on using these tools for computation. 


7. Computational Results. The author has implemented all three of the 
above techniques for multi-precision multiplication on the Cray-2. By employing 
special high-performance techniques [1], the complex transform can be made to run 
the fastest, about four times faster than the two-prime transform method. However, 
the memory requirement of the two-prime scheme is significantly less than either 
the three-prime or the complex scheme, and since the two-prime scheme permits 
very high-precision computation, it was selected for the computations of 7. 

One of the author’s computations used twelve iterations of Borweins’ quartic 
algorithm for 1/7, followed by a reciprocal operation, to yield 29,360,128 digits 
of x. In this computation, approximately 12 trillion arithmetic operations were 
performed. The run took 28 hours of processing time on one of the four Cray-2 
central processing units and used 138 million words of main memory. It was started 
on January 7, 1986 and completed January 9, 1986. The program was not running 
this entire time—the system was taken down for service several times, and the run 
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was frequently interrupted by other programs. Restarting the computation after a 
system down was a simple matter since the two key multi-precision number arrays 
were saved on disk after the completion of each iteration. 

This computation was checked using 24 iterations of Borweins’ quadratically 
convergent algorithm for 7. This run took 40 hours processing time and 147 million 
words of main memory. A comparison of these output results with the first run 
found no discrepancies except for the last 24 digits, a normal truncation error. 
Thus it can be safely assumed that at least 29,360,000 digits of the final result are 
correct. 

It was discovered after both computations were completed that one loop in the 
Chinese remainder theorem computation was inadvertently performed in scalar 
mode instead of vector mode. As a result, both of these calculations used about 
25% more run time than would otherwise have been required. This error, however, 
did not affect the validity of the computed decimal] expansions. 


8. Statistical Analysis of 7. Probably the most significant mathematical 
motivation for the computation of 7, both historically and in modern times, has 
been to investigate the question of the randomness of its decimal expansion. Before 
Lambert proved in 1766 that 7 is irrational, there was great interest in checking 
whether or not its decimal expansion eventually repeats, thus disclosing that 7 is 
rational. Since that time there has been a continuing interest in the still unan- 
swered question of whether the expansion is statistically random. It is of course 
strongly suspected that the decimal expansion of 7, if computed to sufficiently high 
precision, will pass any reasonable statistical test for randomness. The most fre- 
quently mentioned conjecture along this line is that any sequence of n digits occurs 
with a limiting frequency of 107”. 

With 29,360,000 digits, the frequencies of n-long strings may be studied for 
randomness for n as high as six. Beyond that level the expected number of any 
one string is too low for statistical tests to be meaningful. The results of tabulated 
frequencies for one and two digit strings are listed in Tables 1 and 2. In the first 
table the Z-score numbers are computed as the deviation from the mean divided 
by the standard deviation, and thus these statistics should be normally distributed 
with mean zero and variance one. 


TABLE 1 
Single digit statistics 
Digit Count Deviation Z-score 
0 2935072 — 928 — 0.5709 
1 2936516 516 0.3174 
2 2936843 843 0.5186 
3 2935205 — 795 — 0.4891 
4 2938787 2787 1.7145 
5 2936197 197 0.1212 
6 2935504 — 496 — 0.3051 
7 2934083 — 1917 — 1.1793 
8 2935698 — 302 — 0.1858 
9 2936095 95 0.0584 
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00 
05 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
79 
80 
85 
90 
95 
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293062 
294189 
294503 
293158 
293952 
293721 
293718 
293490 
294622 
293998 
292736 
294194 
293842 
293544 
292650 
293199 
292517 
293600 
293470 
293104 


01 
06 
11 
16 
21 
26 
31 
36 
41 
46 
51 
56 
61 
66 
71 
76 
81 
86 
91 
96 


TABLE 2 


Two digit frequency counts 


293970 
292688 
293409 
293799 
293226 
293655 
293542 
293484 
294793 
294418 
294272 
293260 
293105 
293123 
294304 
293597 
292986 
293786 
292908 
293694 


02 
07 
12 
17 
22 
27 
32 
37 
42 
47 
52 
57 
62 


293533 
292707 
293591 
293020 
293844 
293969 
293272 
292694 
293863 
293616 
293614 
294152 
294187 
293307 
293497 
292745 
293637 
293971 
293806 
293902 


03 
08 
13 
18 
23 
28 
33 
38 
43 
48 
53 
58 
63 
68 
73 
78 
83 
88 
93 
98 


292893 
294260 
294285 
293262 
293382 
293320 
293422 
294152 
293041 
293296 
293215 
293137 
293809 
293602 
293761 
293223 
294475 
293434 
292922 
294012 


04 
09 
14 
19 
24 
29 
34 
39 
44 
49 
54 
59 
64 
69 
74 
79 
84 
89 
94 
99 


294459 
293311 
294020 
293469 
293869 
293905 
293178 
294253 
293519 
293621 
293569 
294048 
293463 
293522 
293960 
293147 
294267 
293025 
294483 
293794 
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The most appropriate statistical procedure for testing the hypothesis that the 
empirical frequencies of n-long strings of digits are random is the x? test. The x? 
statistic of the k observations X,,X2,...,X% is defined as 


gre > (X ue 
a % 


t=1 


where £; is the expected value of the random variable X;. In this case k = 10" and 
E,; = 107d for all 7, where d = 29, 360, 000 denotes the number of digits. The mean 
of the x? statistic in this case is k — 1 and its standard deviation is \/2(k — 1). Its 
distribution is nearly normal for large k. The results of the x? analysis are shown 


in Table 3. 


Length 


1 
2 
3 
4 
5 


6 


TABLE 3 


x? value 
4.869696 
84.52604 
983.9108 
10147.258 
100257.92 
1000827.7 
Another test that is frequently performed on long pseudo-random sequences is 
an analysis to check whether the number of n-long repeats for various n is within 
statistical bounds of randomness. An n-long repeat is said to occur if the n-long 


Multiple digit x? statistics 


Z-score 
— 0.9735 
— 1.0286 
— 0.3376 

1.0484 
0.5790 
0.5860 
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digit sequence beginning at two different positions is the same. The mean M and 
the variance V of the number of n-long repeats in d digits are (to an excellent 
approximation) 
10-"d? 11-10~-"d? 
M= a V= — 

Tabulation of repeats in the expansion of + was performed by packing the string 
beginning at each position into a single Cray-2 word, sorting the resulting array, 
and counting equal contiguous entries in the sorted list. The results of this analysis 
are shown in Table 4. 


TABLE 4 
Long repeat statistics 


10 42945 43100. — 0.677 
11 4385 4310. 1.033 
12 447 431. 0.697 
13 48 43.1 0.675 
14 6 4.31 0.736 
15 1 0.43 0.784 


A third test frequently performed as a test for randomness is the runs test. This 
test compares the observed frequency of long runs of a single digit with the number 
of such occurrences that would be expected at random. The mean and variance of 
this statistic are the same as the formulas for repeats, except that d? is replaced by 
2d. Table 5 lists the observed frequencies of runs for the calculated expansion of 7. 

The frequencies of long runs are all within acceptable limits of randomness. The 
only phenomenon of any note in Table 5 is the occurrence of a 9-long run of sevens. 
However, there is a 29% chance that a 9-long run of some digit would occur in 
29,360,000 digits, so this instance by itself is not remarkable. 

TABLE 5 
Single-digit run counts 
Length of Run 
Digit 5 6 7 

0 308 29 
281 21 
272 23 
266 26 
296 40 
292 30 
316 33 
315 37 
295 36 
306 40 


9. Conclusion. The statistical analyses that have been performed on the ex- 
pansion of 7 to 29,360,000 decimal places have not disclosed any irregularity. The 
observed frequencies of n-long strings of digits for m up to 6 are entirely unremark- 
able. The numbers of long repeating strings and single-digit r-ns are completely 
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acceptable. Thus, based on these tests, the decimal expansion of 7 appears to be 
completely random. 


Appendix 


Selected Output Listing 


Initial 1000 digits: 


3k. 
14159265358979323846264338327950288419716939937510 
58209749445923078164062862089986280348253421170679 
82148086513282306647093844609550582231725359408128 
48111745028 410270193852110555964462294895493038196 
44288109756659334461284756482337867831652712019091 
45648566923460348610454326648213393607260249141273 
72458700660631558817488152092096282925409171536436 
78925903600113305305488204665213841469519415116094 
33057270365759591953092186117381932611793105118548 
0744623799627495673518857527248912279 3818301194912 
98336733624406566430860213949463952247371907021798 
60943702770539217176293176752384674818467669405132 
00056812714526356082778577134275778960917363717872 
14684409012249534301465495853710507922796892589235 
42019956112129021960864034418159813629774771309960 
51870721134999999837297804995105973173281609631859 
50244594553 469083026425223082533446850352619311881 
71010003137838752886587533208381420617177669147303 
5982534904287554687 3115956286 388235378759375195778 
18577805321712268066130019278766111959092164201989 


Digits 4,999,001 to 5,000,000: 


49480754784558100182731931632488412804488722296956 
79855015464855780486736535227902836997918084867230 
64962221004527085768335035212069684801817137616329 
97561738425160340472537100056351640342162492027179 
66824926458930960182645026923102266570541641475347 
20341554913770421505764452807809035248393621093031 
02288096238486877923145240841637271180953058890040 
68843766781431498914299893621278545260143140439048 
49938801556336059513116731891132765777881364690708 
47036863411196323063886507480852125682842257852524 
03086993703255692093960818587414181230484153204049 
20234989002732447593020323794790776444752398445514 
67304403210968985244961967143433964895893190552338 
49818852746844924836314634250006421630628686858848 
27453318669926734730642735036364002856022218966350 
11429182634319974163253372368798553451111253055262 
39104082639970934508146672521381105913047210052428 
18988626533169469331951675296209306752291590715999 
898 46179288059262000848638138811280944056488021060 
48865855191846702365421761783505181721320764619715 
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Digits 9,999,001 to 10,000,000: 


55097818243516728227849910720400286757907904466335 
12718202979525150617725334066894988956424703269230 
15399820900390166275224338184424808589395293652582 
53635658584175485536744818650289245188206447853280 
79129675504865572929083083485483937583334671019089 
12067114536955173140929461823466478725289529974204 
02127635235923293305770179423865225963240694027480 
60412880303092452481034941582735932443887273109397 
41634889604695819245395151341043433998381874650972 
33692635225791472454244401326312964396391209607800 
16344851199125420819737407446045899742145731042313 
64456 48650193780106 352660374405656882 3861389375443 
97351681296831567911618884222251141477322612331396 
18606080373110348692660933940438416300326143449280 
50821131575737727739821551522286509997662432587213 
93393445902091662272905493493827178205126669021149 
47192311380933822311224099588372246332501222323378 
96895269025366263941267010317327864987170257149617 
76105155492579857592045532468944687427025046397905 
65326553194060999469787333810631719481735348955897 


Digits 14,999,001 to 15,000,000: 


75161912582729034437123279749256311511925243956985 
41466735069194815163837226073925151887751751659741 
00622880726 448602209456930414488539882981108512492 
30626088375966783621649753412539683084922711342513 
94953995693625441331401738133085848172315887473225 
66862139251938540102249475575494947158395623512785 
67033888824495551084462300472407612165952784386252 
83059992 302223284865934566 2629297484 36827730812030 
14434593689874259766415514412097984133998015934584 
35393475650624323850160432731918805126406671871353 
77555766214670931813151162879500509710551795152818 
09093154481058044767364122166100032425098263166257 
41730518220480715488224616563891344046934208103238 
39903254029881746342496583186836947486194257533540 
36331223838222392494056270856378033056213544686593 
02986821714952808585949418676532291067339817684850 
77576151785057277099880627370814385794117668763599 
75814499149890314594098525960336377989988228138579 
03954608500076180754880433958468619641092762653446 
79645205263473393286074979 3239 31503141172775669803 
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Digits 19,999,001 to 20,000,000: 


24662421652199659486815804456870197576438951607697 
86758526528445124126249995515004465281646092893016 
37396198596248627116552 469686 381679679898926165214 
19985145392716546108714664257998278750239431446690 
24524827883001435830699295155565194378002452231513 
03498450165135282534109758167508041457187906821950 
98156889669401540575560430489547131781464796920586 
99611799897126 3887365315643 45333853581593559913668 
62608486 227029865668230856 391322081859205243349223 
41898466479821052634622968628766495150696262416056 
24275201300452308788083860012754008114751496913646 
62422297630443481605116791864334302662386921297850 
27885235888942133721123400642720173755448172632485 
38990548569368292370090889371435442648824207842546 
28067400727949203553263884395310176843535902614634 
76307233029969045465206192626213143248919480318684 
24091340888618503237670440877047193079665717842568 
49026897445701681738816789861189706430445720674936 
81903857815020793 4661566 44931359073005891342758785 
95072447895232808191116291055801380049338634527644 


Digits 24,999,001 to 25,000,000: 


6 4626376657788 401626872035835150250932381126804132 
24527774629670113871130617683224437149346115597163 
91099108362268853888484703799982396604187954247350 
366358595213045168727098096789 48655853409228442863 
249489 36001342207955968740967092110719683856558205 
30816048151902240856062148774123551023529985810792 
74189214723685203602121713995138514107079374902532 
54350785997288413483911434952219864948321330490074 
60146435121254311259573947301142531184570914224080 
72612210306331872567179327168155609249989038137333 
66960257521334843154895361888436208731274888674781 
18373984739313750077149269011462219615798047067514 
35050981335283641909759090614464729227662129370246 
47057090874450108027231969863517024941726518038367 
3276289174186 3822149208539226376382907305941739639 
075495888658 49168186491743776278287261919660505923 
924757388 36587226649359524383297861404378228288281 
73596312642574370611956801297356036342637793562761 
38037507909491563108238168922672241753290045253446 
07864115924597806944245511285225546774836191884322 
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Digits 29,359,001 to 29,360,000: 


34192841788915229643368473881977698539005746219846 
69525347577001729886543392436261840972591968259157 
61107476294007303074005235627829787025544075405543 
99895071530598162189611315050419697309728290606067 
18890116138206842589980215445395753593792898823575 
01412347486672046935635735777380648437308573291840 
62108496 330974827689411268675222975523230623956833 
6263114891606 3883977661973091499155192847894109691 
39612265329351195978725566 764256 462895375180907449 
49 363092921314127640888510170422584084744149319118 
65755825721772836144977978766052285469047197596264 
76680055360842209689517737135008611890452433015212 
37693745702070338988940123376693961057269535278146 
99719136307074643201853864071307997507974509883554 
65961575782849747512645786441130845325323149405419 
1726336 4899647912032878171893387317819324912382342 
18648271763723022561720016348368584955658165112489 
95446848720693621957797943429494640258 419939089135 
34266985232776239314365259670832026370250924776814 
70490971424493675414330987259507806654322272888253 
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Paper 8: Gert Almkvist and Bruce Berndt, “Gauss, Landen, Ramanujan, the 
arithmetic-geometric mean, ellipses, pi, and the Ladies Diary,” American Math- 
ematical Monthly, vol. 95 (1988), pg. 585-608. Copyright 1988 Mathematical 
Association of America. All Rights Reserved. 


Synopsis: 

If the title of this article seems wide-ranging, it is because the article itself is, 
delightfully so. The authors take the reader on an entertaining but highly informa- 
tive tour of the arithmetic-geometric mean (AGM), including the historical roots 
of Gauss’s work, a mini-biography of the reclusive 19th century British mathemati- 
cian John Landen, who wrote articles on mathematics (including on the AGM) 
for the Ladies Diary, a popular womens magazine that included a mathematics 
column, and late-20th-century developments including the new quadratically con- 
vergent algorithms of Salamin, Brent and others, and computations implementing 
these techniques on powerful computer systems. All of this is presented in a very 
rigorous style that exposes both the methods and results of the theory. 

Almkvist and Bernt also discuss some extensions of this theory that were un- 
dertaken by the famed Indian mathematician Ramanujan. The authors were well- 
equipped to present this material, since Berndt, for instance, was in the process 
of editing the works of Ramanujan for a modern audience. The result is a very 
enlightening mathematical tour of the AGM. 


Keywords: Algorithms, Arithmetic-Geometric Mean, Computation, Elliptic Inte- 
grals, History 
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hitherto unproven results in Ramanujan’s notebooks. His book, Ramanujan’s 
Notebooks, Part I (Springer-Verlag, 1985), is the first of either three or four 
volumes to be published on this project. 


Virtue and sense, with female-softness join’d 

(All that subdues and captivates mankind!) 

In Britain’s matchless fair resplendent shine; 
They rule Love’s empire by a right divine: 

Justly their charms the astonished World admires, 
Whom Royal Charlotte’s bright example fires. 


1. Introduction. The arithmetic-geometric mean was first discovered by Lagrange 
and rediscovered by Gauss a few years later while he was a teenager. However, 
Gauss’s major contributions, including an elegant integral representation, were 
made about 7-9 years later. The first purpose of this article is, then, to explain the 
arithmetic-geometric mean and to describe some of its major properties, many of 
which are due to Gauss. 


*Research partially supported by the Vaughn Foundation. 
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Because of its rapid convergence, the arithmetic-geometric mean has been signifi- 
cantly employed in the past decade in fast machine computation. A second purpose 
of this article is thus to delineate its role in the computation of 7. We emphasize 
that the arithmetic-geometric mean has much broader applications, e.g., to the 
calculation of elementary functions such as log x, e*, sin x, and cos x. The inter- 
ested reader should further consult the several references cited here, especially 
Brent’s paper [14] and the Borweins’ book [13]. 

The determination of the arithmetic-geometric mean is intimately related to the 
calculation of the perimeter of an ellipse. Since the days of Kepler and Euler, several 
approximate formulas have been devised to calculate the perimeter. The primary 
motivation in deriving such approximations was evidently the desire to accurately 
calculate the elliptical orbits of planets. A third purpose of this article is thus to 
describe the connections between the arithmetic-geometric mean and the perimeter 
of an ellipse, and to survey many of the approximate formulas that have been given 
in the literature. The most accurate of these is due to Ramanujan, who also found 
some extraordinarily unusual and exotic approximations to elliptical perimeters. 
The latter results are found in his notebooks and have never been published, and so 
we shall pay particular attention to these approximations. 

Also contributing to this circle of ideas is the English mathematician John 
Landen. In the study of both the arithmetic-geometric mean and the determination 
of elliptical perimeters, there arises his most important mathematical contribution, 
which is now called Landen’s transformation. Many very important and seemingly 
unrelated guises of Landen’s transformation exist in the literature. Thus, a fourth 
purpose of this article is to delineate several formulations of Landen’s transforma- 
tion as well as to provide a short biography of this undeservedly, rather obscure, 
mathematician. 

For several years, Landen published almost exclusively in the Ladies Diary. This 
is, historically, the first regularly published periodical to contain a section devoted 
to the posing of mathematical problems and their solutions. Because an important 
feature of the MONTHLY has its roots in the Ladies Diary, it seems then dually 
appropriate in this paper to provide a brief description of the Ladies Diary. 


2. Gauss and the arithmetic-geometric mean. As we previously alluded, the 
arithmetic-geometric mean was first set forth in a memoir of Lagrange [30] pub- 
lished in 1784-85. However, in a letter, dated April 16, 1816, to a friend, H. C. 
Schumacher, Gauss confided that he independently discovered the arithmetic-geo- 
metric mean in 1791 at the age of 14. At about the age of 22 or 23, Gauss wrote a 
long paper [23] describing his many discoveries on the arithmetic-geometric mean. 
However, this work, like many others by Gauss, was not published until after his 
death. Gauss’s fundamental paper thus did not appear until 1866 when E. Schering, 
the editor of Gauss’s complete works, published the paper as part of Gauss’s 
Nachlass. Gauss obviously attached considerable importance to his findings on the 
arithmetic-geometric mean, for several of the entries in his diary, in particular, from 
the years 1799 to 1800, pertain to the arithmetic-geometric mean. Some of these 
entries are quite vague, and we may still not know everything that Gauss discovered 
about the arithmetic-geometric mean. (For an English translation of Gauss’s diary 
together with commentary, see a paper by J. J. Gray [24].) 

By now, the reader is anxious to learn about the arithmetic-geometric mean and 
what the young Gauss discovered. 
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Let a and b denote positive numbers with a > b. Construct a sequence of 
arithmetic means and a sequence of geometric means as follows: 


1 
a, = (a + b), b, = Vab, 


1 
a,= Pac + b,), b, = ya,b,, 


1 
Gna = 3 (4n F b,), are = ja,b ? 


Gauss [23] gives four numerical examples, of which we reproduce one. Let a = 1 
and b = 0.8. Then 


a, = 0.9, b, = 0.894427190999915878564, 
a, = 0.897213595499957939282, b, = 0.897209268732734, 
a, = 0.897211432116346, b, = 0.897211432113738, 
a, = 0.897211432115042, b, = 0.897211432115042. 


(Obviously, Gauss did not shirk from numerical calculations.) It appears from this 
example that {a,} and {b,} converge to the same limit, and that furthermore this 
convergence is very rapid. This we now demonstrate. 

Observe that — 


b<b, <a, <a, 
b<b, <b, <a,<a, <a, 
b<b, <b, <by<a,<a,<a, <a, 
etc. Thus, {b,} is increasing and bounded, and {a,,} is decreasing and bounded. 


Each sequence therefore converges. Elementary algebraic manipulation now shows 
that 


a,—b, a-—b a-b 
a-b 4(a,+b,) 2(a+b)+ 4b, 


1 
<s. 

2 
Iterating this procedure, we deduce that 

1 n 
a,~6,<(5| (a—b),. ne, 

which tends to 0 as n tends to oo. Thus, a, and b, converge to the same limit, 
which we denote by M(a, b). By definition, M(a, b) is the arithmetic-geometric 


mean of a and b. 
To provide a more quantitative measure of the rapidity of convergence, first 


define 
C, = yan — be, n>0, (1) 
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where a, = a and by = b. Observe that 
2 


Cn 


cn 


7 < — 
4a,.,  4M(a,b) 


Thus, c, tends to 0 quadratically, or the convergence is of the second order. More 
generally, suppose that {a,,} converges to L and assume that there exist constants 
C > 0 and m > 1 such that 


1 
Cnti = (an as b,) 


[Qn4,—- L| < Cla, — L|”, n>. 


Then we say that the convergence is of the mth order. 
Perhaps the most significant theorem in Gauss’s paper [23] is the following 
representation for M for which we provide Gauss’s ingenious proof. 


THEOREM 1. Let |x| < 1, and define 
K(x) = fa _ x?sin’p) dg. (2) 
0 
Then 


M(L +%,1~2)= 5275. 


The integral K(x) is called the complete elliptic integral of the first kind. 


Observe that in the definition of K(x), sin’p may be replaced by cos’. 
Before proving Theorem 1, we give a reformulation of it. Define 


I(a,b) = [(a@eos’e + b’sin’p) |” dy. (3) 
0 
It is easy to see that 
1 
I(a,b) = —K(x), 
a 


where 


Since 
M(a,b) = M(a,,b,) and M(ca,cb) = cM(a, b), (4) 


for any constant c, it follows that, with x as above, 
1 
M(1+x,1-—.x)= GMa, b). 


The following reformulation of Theorem 1 is now immediate. 


THEOREM 1’. Leta > b> 0. Then 


M(a, b) ora 21(a, b)’ 
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Proof. Clearly, M(1 + x,1— x) is an even function of x. Gauss then assumes 
that 


1 foe) 
—— SA, x, 5 
M(1+x,1- <x) x Kx (5) 


Now make the sat x = 2t/(1 + 1”). From (4), it follows that 


M(1+x,1-x)= 7G sM((1+1t)°,(1~1#)’) = 


are *_ (i +0,1-?). 


Substituting in (5), we nee that 


+2) A= Lala i 


k=0 


Clearly, Ay = 1. Expanding (1 + 17)~?*~1, k > 0, in a binomial series and equating 
coefficients of like powers of ¢ on both sides, we eventually find that 


1 ponte ee Hed Vega | BES aes 
ee — “+ — ene 
M(1 + x,1—x) (5) * ari Fa) * 


Bl : 
AZ ie 


x k 2k 
= Ks 
=o (k!) 
Here we have introduced the notation 
(a), =a(at+1)(a+2)---(at+k-1). (7) 


Complete details for the derivation of (6) may be found in Gauss’s paper [23, pp. 
367-369]. 

We now must identify the series in (6) with K(x). Expanding the integrand of 
K(x) in a binomial series and integrating termwise, we find that 


K(x) 


Ml 


(8) 


Combining (6) and (8), we complete the proof of Gauss’s theorem. 


Another short, elegant proof of Theorem 1 has been given by Newman [39] and is 
sketched by J. M. and P. B. Borwein [10]. 

For a very readable, excellent account of Gauss’s many contributions to the 
arithmetic-geometric mean, see Cox’s paper [18]. We shall continue the discussion of 
some of Gauss’s discoveries in Section 5. 


3. Landen and the Ladies Diary. We next sketch another proof of Theorem 1 (or 
Theorem 1’) which is essentially due to the eighteenth-century English mathemati- 
cian John Landen. 
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Second proof. Although the basic idea is due to Landen, the iterative procedure 
that we shall describe is apparently due to Legendre [33, pp. 79-83] some years 
later. 

For brevity, set 

Xn = Cp/ Ons n> 0, (9) 


where c, is defined by (1). In the complete elliptic integral of the first kind (2), 
make the substitution 

sin(2¢) 
x, + cos(2@) © 


(10) 


This is called Landen’s transformation. After a considerable amount of work, we 
find that 


tang, = 


K(x) = (1 + x,) K(x). 
Upon n iterations, we deduce that 
K(x) = (1+ x,)(1 + x,)---(1+x,)K(x,). (11) 
Since, by (1) and (9), 1 + x, = a,_,/a,, k 2 1, we see that (11) reduces to 


K(x) = —K(x,). 


We now let n tend to oo. Since a, tends to M(a, b) and x,, tends to 0, we conclude 
that 


a am 
M(a, b) 2M(a,b)- 


Landen’s transformation (10) was introduced by him in a paper [31] published in 
1771 and in more developed form in his most famous paper [32] published in 1775. 
There exist several versions of Landen’s transformation. Often Landen’s transfor- 
mation is expressed as an equality between two differentials in the theory of elliptic 
functions [17], [37]. The importance of Landen’s transformation is conveyed by 
Mittag-Leffler who, in his very perceptive survey [37, p. 291] on the theory of elliptic 
functions, remarks, “Euler’s addition theorem and the transformation theorem of 
Landen and Lagrange were the two fundamental ideas of which the theory of 
elliptic functions was in possession when this theory was brought up for renewed 
consideration by Legendre in 1786.” 

In Section 4, we shall prove the following theorem, which is often called Landen’s 
transformation for complete elliptic integrals of the first kind. 


K(x) = K(0) = 


THEOREM 2. If 0 < x < 1, then 


2vx 


1+x 


| = (1+ x)K(x). 


In fact, Theorem 2 is the special case a = 7, 8 = 7/2 of the following more 
general formula. If x sina = sin(28 — «), then 
a 4x ee 
(1 +x) f (1~ sin) “ay =2f*[1 - : | dg, 
0 0 


(1+ x) 
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which is known as Landen’s transformation for incomplete elliptic integrals of the 
first kind. 
To describe another form of Landen’s transformation, we introduce Gauss’s 
ordinary hypergeometric series 
© (a),(b 
F(a, b; C3 x) = x ee 
ag VEDERE 
where a, b, and c¢ denote arbitrary complex numbers and (a), is defined by (7). 
Then 


|x| <1, (12) 


1 1 
(0,26 = (14 a)"F[a,0-b4 550+ 5527] (13) 


4x 
(1 +x)’ 2 
is Landen’s transformation for hypergeometric series. Theorems 1 and 2 imply the 
special case 


11 / 4x 
2°2’°(1+xy 


Thus, a seemingly innocent “change of variable” (10) has many important 
ramifications. Indeed, Landen himself evidently never realized the importance of his 
idea. 

Since Landen undoubtedly is not known to most readers, it seems appropriate 
here to give a brief biography. He was born in 1719. According to the Encyclopedia 
Britannica [20], “He lived a very retired life, and saw little or nothing of society; 
when he did mingle in it, his dogmatism and pugnacity caused him to be generally 
shunned.” In 1762, he was appointed as the land-agent to the Earl Fitzwilliam, a 
post he held until two years before his death in 1790. 

As a mathematician, Landen was primarily an analyst and geometer. Most of his 
important works were published in the latter part of his career. These include the 
aforementioned papers and Mathematical Memoirs, published in 1780 and 1789. 
For several years, Landen contributed many problems and solutions to the Ladies 
Diary. From 1743-1749, he posed a total of eleven problems and published thirteen 
solutions to problems. However, Leybourn [34] has disclosed that contributors to 
the Ladies Diary frequently employed aliases. In particular, Landen used the 
pseudonyms Sir Stately Stiff, Peter Walton, Waltoniensis, C. Bumpkin, and Peter 
Puzzlem, who, collectively, proposed ten problems and answered seventeen. 
Leybourn [34] has compiled in four volumes the problems and solutions from the 
Ladies Diary from 1704-1816. Especially valuable are his indices of subject classifi- 
cations and contributors. (The problems and solutions from the years 1704-1760 
had been previously collected by others in one volume in 1774 [50}.) 

First published in 1704, the annual Ladies Diary evidently was very popular in 
England with a yearly circulation of several thousand. The Ladies Diary is “de- 
signed principally for the amusement and instruction of the fair sex.” It contains 
“new improvements in arts and sciences, and many entertaining particulars... for 
the use and diversion of the fair sex.” The cover is graced by a poem dedicated to 
the reigning queen and which normally changed little from year to year. Our paper 
begins with the poem from 1776 paying eloquent homage to the beloved of King 
George III. Among other things, the Ladies Diary contains a “chronology of 


1 ce : 1; x? 
=(1+ st certs : 
( a) Gas =) 
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remarkable events,” birth dates of the royal family, enigmas, and answers to 
enigmas from the previous year. The enigmas as well as the answers were normally 
set to verse. 

The largest portion of the Ladies Diary is devoted to the solutions of mathemati- 
cal problems posed in the previous issue. Despite the name of the journal, very few 
contributors were women. Leybourn’s [34] index lists a total of 913 contributors of 
which 32 were women. Because proposers and solvers did occasionally employ pen 
names such as Plus Minus, Mathematicus, Amicus, Archimedes, Diophantoides, and 
the aforementioned aliases for Landen, it is possible that the number of female 
contributors is slightly higher. In 1747, Landen gave a solution to a problem which 
was “designed to improve gunnery of which there are several things wanting.” Does 
not this have a familiar ring today? Geometrical problems were popular, and rigor 
was lax at times. Here is an example from 1783. Let 


-Yoey amd b- bee. 


Show that a/b = y2 — 1. In 1784, Joseph French provided the following “elegant” 
solution. We see that 


avae-Le +b. 


k=1 


Thus, b(y2 — 1) = a, and the result follows. 

Those readers wishing to learn more about Landen’s work should consult 
Watson’s very delightful article, “The Marquis and the Land-agent” [52]. Readers 
desiring more knowledge of the mathematical content of the Ladies Diary should 
definitely consult Leybourn’s compendium [34]. (Only a few libraries in the U.S. 
possess copies of the Ladies Diary. The University of Illinois Library has a fairly 
complete collection, although there are several gaps prior to 1774. T. Perl [43] has 
written a detailed description of the Ladies Diary with an emphasis on the 
contributions by women and an analysis of both the positive and negative sociologi- 
cal factors on womens’ mathematical education during the years of the Diary. For 
additional historical information about the Ladies Diary and other obscure English 
journals containing mathematics, see Archibald’s paper [2].) 


4, Ivory and Landen’s transformation. In 1796, J. Ivory [25] published a new 
formula for the perimeter of an ellipse. A very similar proof establishes Theorem 2, 
a version of Landen’s transformation discussed in the previous section. 

Before proving Theorem 2, we note that it implies a new version of Theorem 1. 


THEOREM 1”. If x > 0, then 
a(1 +x) 


2Vx | 


M(1+x,1-—x)= 


2K 
1+x 


Theorem 1’ also follows from Theorem 1”; put x = (a — b)/(a + 5) and uti- 
lize (4). 


1988] 
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Proof of Theorem 2. Using the definition (2) of K, employing the binomial series, 
and inverting the order of summation and integration below, we find that 


_ 


2vx 


1+x 


| 


ee 4x ue 
—f 1 - — > sir d 
2 i | (l+xy ° r 
7 2x vee 
f b — (1 - cos(29) dy 


(1 + x) 


(1 + xf" + x? + 2x cos(2)) dp 
0 


NJP Nie NIP 


(1+ x) fa + xerv) (1 + xe~2iv 0? dy 
0 


pe 7 FN 


0 


ke . 
e2i(m—nje dy 


(1+x)K(x), 


by (8). This concludes the proof. 


Ivory’s paper [25], establishing an analogue of Theorem 2, possesses an unusual 
feature in that it begins with the “cover letter” that Ivory sent to the editor John 
Playfair when he submitted his paper! In this letter, Ivory informs Playfair about 
what led him to his discovery. Evidently then, the editor deemed it fair play to 
publish Ivory’s letter as a preamble to his paper. The letter reads as follows. 


Dear 


Sir, 


Having, as you know, bestowed a good deal of time and attention on the study of that part of 
physical astronomy which relates to the mutual disturbances of the planets, I have, naturally, been 
led to consider the various methods of resolving the formula (a? + b? — 2ab cos ~)" into infinite 
series of the form A + Bcosg + Ccos2g + &c. In the course of these investigations, a series for 
the rectification of the ellipsis occurred to me, remarkable for its simplicity, as well as its rapid 
convergency. As I believe it to be new, I send it to you, inclosed, together with some remarks on 
the evolution of the formula just mentioned, which if you think proper, you may submit to the 
consideration of the Royal Society, 


I am, Dear Sir, 
Yours, & c. 
James Ivory 


5. Calculation of «. First, we define the complete elliptic integral of the second 


kind, 


E(x) = fre - x2sintp) dg, 


where |x] < 1. Two formulas relating the elliptic integrals E(x) and K(x) are the 
basis for one of the currently most efficient methods to calculate 7. The first is due 
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to Legendre [33, p. 61]. We give below a simple proof that appears not to have been, 
heretofore, given. 


THEOREM 3. Let x’ = ¥1 — x”, where 0 < x <1. Then 
7 
K(x)E(x’) + K(x’) E(x) -— K(x) K(x’) = a (14) 


Proof. Let c = x? and c’ = 1 — c. A straightforward calculation gives 


d d pa/2 c sin’ 
= (f= K)= dp 
de de i (1 - csin’p)'”” 

= E 1 a /2 dg 

~ 2e me, (1 ee sin’p) ” ; 


Since 


1 
= aC — csin’p)'” — =A — csin’p) ad 


d sin @ cos p 
dg ( 


i csin’p)'”” 
we deduce that 
d E E 1 par ad sin 9 cos p 
gi a5. “sa oal, ae Gane 
c sin’ (15) 


2c 2cc’ 2c’ tno dp 
E 1 E 
- xl - Z| mare 
For brevity, put K’ = K(c’) and E’ = E(c’). Since c’ = 1 — ¢, it follows that 
qd, : E’ 
aE a Ba (16) 
Lastly, easy calculations yield 
dE E-K dE’ E’— K’ 
de te. Oe ag a 


If Z denotes the left side of (14), we may write Z in the form 
L= EE'-(E-K)(E'-— K’). 
Employing (15)—(17), we find that 


db _(E-K)E’ E(E’-K’) , FE KY). (B=K)E" | 


Hence, L is a constant, and we will find its value by letting ¢ approach 0. 
First, 


a /2 sin’ 
E-K=-e —_—_—_§,, dp = O(c 
i (1 — csin’p)'” 
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as c tends to 0. Next, 
or f2. _ , = 
K'=f ‘ (1 — e’sin’p) ss dp< f “a - ce)? dy 
0 0 


= O(c”), 
as c tends to 0. Thus, 
lim L = lim {(E — K)K’ + EK} 
= lim { O(c!) +1- >} = 2. 
c0 2 2 


and the proof is complete. 
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The second key formula, given in Theorem 4 below, can be proved via an 
iterative process involving Landen’s transformation. We forego a proof here; a 


proof may be found, for example, in King’s book [29, pp. 7, 8]. 


THEOREM 4. Let, fora > b> 0, 
J(a,b) = [°? (a?eos’e + b’sin’p)” dq, 
0 

and recall that c,, is defined by (1). Then 

1 2 
J(a,b) = " --~¥ relia b), 

2 n=0 

where I(a, b) is defined by (3). 


Note that 
J(a, b) = aE(x), 


where x = (1/a)V¥a? — b?. 


(18) 


Theorems 3 and 4 now lead to a formula for 7 which is highly suitable for 


computation. 
THEOREM 5. If c,, is defined by (1), then 


4M*(1,1/y2) 
T= SO 
res Ne Saree 


n=1 


Proof. Letting x = x’ = 1/ ¥2 in Theorem 3, we find that 


2K : E : K> : ut 
v2 \ v2 v2} 2° 
Setting a = 1 and b= 1/2 in Theorem 4, we see that 


a)-t-3Ea 


(19) 


(20) 
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since I(1, V2) = K(1/y2) and J(1,1/ ¥2) = E(1/ y2). Lastly, by Theorem 1’, 


T 
M(1,1/¥2 ) = —-— = .- 21 
Substituting (20) into (19), employing (21), noting that c} = 1/2, and solving for 7, 
we complete the proof. 


According to King [29, pp. 8, 9, 12], an equivalent form of Theorem 5 was 
established by Gauss. Observe that in the proof of Theorem 5, we used only the 
special case x = x’ = 1/2 of Legendre’s identity, Theorem 3. We would like to 
show now that this special case is agian to the formula 


{aoalt a 7 (2) 


first proved by Euler [22] in 1782. (Watson [52, p. 12] claimed that an equivalent 
formulation of (22) was earlier established by both Landen and Wallis, but we have 
been unable to verify this.) The fom integral in (22) is one quarter of the arc 
length of the lemniscate given by r? = cos(2p), 0 < » < 2a. The latter integral in 
(22) is intimately connected with the classical elastic curve. For a further elaboration 
of the connections of these two curves with the arithmetic-geometric mean, see 
Cox’s paper [18]. 

In order to prove (22), make the substitution x = cos g. Then straightforward 
calculations yield 


saa ear 


and 


lar) -a(el- 


It is now easy to see that Legendre’s relation Theorem 3 in the case x = x’ = 1/ ¥2 
implies (22). 

In 1976, Salamin [47] rederived the forgotten Theorem 5, from which he estab- 
lished a rapidly convergent algorithm for the computation of 7. Recall that in 
Section 2 we demonstrated how rapidly the arithmetic-geometric mean converges 
and thus how fast c, tends to 0. Tamura and Kanada have used this algorithm to 
compute 7 to 24 (over 16 million) decimal places. An announcement about their 
calculation of 7 to 2” decimal places was made in Scientific American [48]. Their 
paper [27] describes their calculation to 10,013,395 decimal places. More recently, 
D. H. Bailey [3] has used a quartically convergent algorithm to calculate 7 to 
29,360,000 digits. 

Newman [40] has obtained a quadratic algorithm for the computation of 7 that is 
somewhat simpler than Salamin’s. His proof is quite elementary and avoids 
Legendre’s identity. It should be remarked that Newman’s estimates of some 
integrals are not quite correct. However, the final result (middle of p. 209) is correct. 

In 1977, Brent [14] observed that the arithmetic-geometric mean could be 
implemented to calculate elementary functions as well. Let us briefly indicate how 
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to calculate log 2. From Whittaker and Watson’s text [53, p. 522], as x tends to 
1— : 


4 
K(x) > a meer 


From Theorem 1’, 


K(x) = 


7 
2M(1,¥1— x?)’ 
and so 


WT 


4 
log ————> = ; 
evi x? 2M(1, V1 — x?) 
Taking ¥1 — x? = 4-27", we find that 


7 
1082 = SaM(L2=*)’ 
for large n. 

Further improvements in both the calculation of 7 and elementary functions 
have been made by J. M. and P. B. Borwein [8], [9], [10], [11], [12], [13]. In 
particular, in [8], [11], and [12], they have utilized elliptic integrals and modular 
equations to obtain algorithms of higher order convergence to approximate 7. The 
survey article [10] by the Borwein brothers is to be especially recommended. Carlson 
[16] has written an earlier survey on algorithms dependent on the arithmetic-geo- 
metric mean and variants thereof. 


Postscript to «. The challenge of approximating and calculating + has been with 
us for over 4000 years. By 1844, + was known to 200 decimal places. This 
stupendous feat was accomplished by a calculating prodigy named Johann Dase in 
less than two months. On Gauss’s recommendation, the Hamburg Academy of 
Sciences hired Dase to compute the factors of all integers between 7,000,000 and 
10,000,000. Thus, our ideas have come to a full circle. As Beckmann [5, p. 104] 
remarks, “It would thus appear that Carl Friedrich Gauss, who holds so many firsts 
in all branches of mathematics, was also the first to introduce payment for computer 
time.” The computer time now for 29 million digits (28 hours) is considerably less 
than the computer time for 200 digits by Gauss’s computer, Dase. 


6. Approximations for the perimeter Z of an ellipse. If an ellipse is given by the 
parametric equations x = acos@ and y = bsing, 0 < » < 27, then from elemen- 
tary calculus, 


Qn ; 1/2 
L=L(a,b) = a*sin’p + b*cos’y) “” dp 
(a,) = ) - 
= 4J(b, a), 


where J(b, a) is defined by (18). Thus, we see immediately from Theorems 1’ and 4 
that elliptical perimeters and arithmetic-geometric means are inextricably inter- 
twined. Ivory’s letter and our concomitant comments also unmistakenly pointed to 
this union. 
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Before discussing approximations for L, we offer two exact formulas. The former 
is due to MacLaurin [36] in 1742, and the latter was initially found by Ivory [25] in 
1796, although it is implicit in the earlier work of Landen. 


THEOREM 6. Let x =acosqy and y=bsing, 0< gy <2a. Let e= 
(1/a)V¥a? — b?, the eccentricity of the ellipse. Then if F is defined by (12), 


L(a,b) = 2ear(5, — 5515 ¢ (24) 
2 2 
= wa + o)F(~5 - 5548] (25) 
7 ay eae! b 
where 
a—b 
aS a+b 


Proof. The proofs are very similar to those in Sections 2 and 4. First, using (23), 
expanding the integrand in a binomial series, and integrating termwise, we deduce 
that 


L(a,b) = 4a f""(1 - e*cos*p)'” dep 
0 


(26) 


Thus, (24) is established. 


We indicate two proofs of (25). First, in Landen’s transformation (13) of 
hypergeometric series, set a = —1/2, b= 1/2, and x = A. We immediately find 


that 
Gl ea eee ee ad Bens 
(- 5.53 2) = 2a (-5.- 53 > . 
By this formula and (24), formula (25) is demonstrated. 
The second proof that we mention is that of Ivory [25]. Using (26), proceed 
exactly in the same fashion as in the proof of Theorem 2 in Section 4. 


In fact, there exists a third early formula for L(a, b). In 1773, Euler [21] proved 
that 
i 2 


L(a, b) = mj2(a? + b?) revise 


a? —b? 
7H 


Although Euler proceeded differently, we mention that his formula may be derived 
from MacLaurin’s via a certain quadratic transformation for hypergeometric series 
that is different from Landen’s. Euler’s formula also trivially leads to an approxima- 
tion for L(a, b) given in our table below. 
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The problem of determining L(a, b) is not as venerable as that for determining 
a. However, some have argued (not very convincingly) that the problem goes back 
to the time of King Solomon, who hired a craftsman Huram to make a tank. 
According to 1 Kings 7 : 23, “Huram made a round tank of bronze 5 cubits deep, 10 
cubits in diameter, and 30 cubits in circumference.” The implication is clear that the 
ancient Hebrews regarded m as being equal to 3. It has been suggested, perhaps by 
someone who believes that “God makes no mistakes,” that “round” and “depth” 
are to be interpreted loosely, and that the tank really was elliptical in shape, with 
the major axis being 10 cubits and the minor axis being about 9.53 cubits in length. 

As might be expected. the primary impetus in finding methods for calculating 
elliptical perimeters arises from astronomy. In 1609, Kepler [28] offered perhaps the 
first legitimate approximations 


=m(a+b) and L = 2m/ab, 


although, as we shall see, his arguments were not very rigorous and 2nvab was 
intended to be only a lower bound for L. Kepler [28, p. 307] first remarks that the 
ellipse with semiaxes a and b and the circle with radius Yab have the same areas. 
Since the circle has the smaller circumference, 


L > 2avab. 


He (28, p. 368] furthermore remarks that (1/2)(a + b) > Vab, and so concludes 
that 


1 
Lz 2n5(a + b). 


Kepler appears to be using the dubious principle that quantities larger than the 
same number must be about equal. 

Approximations of several types, depending upon the relative sizes of a and 5, 
exist in the literature. In this section, we concentrate on estimates that are best for a 
close to 6. Thus, we shall write all of our approximations in terms of A = 
(a — b)/(a + b) and compare them with the expansion (25). For example, Kepler’s 
second approximation can be written in the form 


Lxa(a+o)(1 -»¥)'”. 


We now show how the formula 
2a 12 
: = ; ot Sa oe eee Qn 2 : 2 
L(a, b) = 43 (a, b) M(a,b) [< 7» a] (27) 


arising from Theorems 1’ and 4, can be used to find approximations to the 
perimeter of an ellipse. Replacing M(a, b) by a, and neglecting the terms with 
n > 2, we find that 


Kae 2m{, o  ,\_ 2maz 5 a+b \ 
are Ge eae 


This formula was first obtained by Ekwall [19] in 1973 as a consequence of a 
formula by Sipos from 1792 [54]. 
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If we replace M(a, 6) by a, in (27) and neglect all terms with n > 3, we find, 
after some calculation, that 


2(a +b)? — (Va — vb)" 
(Va + vb) + 2¥2 Va +b vob 


This formula is complicated enough to dissuade us from calculating further ap- 
proximations by this method. 

We now provide a table of approximations for L(a, b) that have been given in 
the literature. At the left, we list the discoverer (or source) and year of discovery (if 
known). The approximation A(A) for L(a, b)/m(a + b) is given in the second 
column in two forms. In the last column, the first nonzero term in the power series 
for 


L(a,b) = 22 


L(a, b) 
AN) ea) 


is offered so that the accuracy of the approximating formula can be discerned. For 
convenience, we note that 


A(Ar F : Mase 
—_ (A) | 2? Hr? | 


2¥ab 
Kepler [28], 1 = —-%) 
epler [28], 1609 ok qa —- 2?) ; 
y2(a? + b? 1 
Euler {21}, 1773 Ucn ae = (1+ X22 “x 
a+b 4 
Sipos [54], 1792 2(a +b) _ 2 
Ekwall [19], 1973 (Va fe vb) Lt vil _- 


3. ab 3 41 3 
Peano [42], 1889 ~ =---(-x¥)7 —xé 
2 a+b 2 2 64 


2 ail? + p/2 2/3 
Muir [38], 1883 | 


at ( 2 
1 
7 pala +r/7724+ (1 -Apeype 


ie ‘ lja-—b\? ; 
indner [35, p. 439], + 3 at 7) 1 
1904-1920 


1 2 
Nyvoll [41], 1978 - (1 “ 3° 


4(a- by 


49}, 197 pageant ete eee 
ad aca Saal a (5a + 3b)(3a + 5b) 
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3 -— ¥(at 3b)(3a tb) 
a+b 
=3-y4-¥ 
; 2(a + bY’ - (Va - vb) 
(a+5){ (Va + Vb) + Viva +b ab } 
(i+vi-¥) +x -# 


(+vio8)(14 Yi) 


Bronshtein and 1 64(a+ y* as by! 
Semendyayev Ea PER ee 
[15], 1964 16 (a + b)°(3a + b)(a + 3b) 


64 - 30 


Ramanujan [44], [45], 1914 
Fergestad [49], 1951 


Almkvist [1], 1978 


Selmer [49], 1975 


Selmer [49], 1975 


2y2(a? + 6ab +b?) 


a+b 


Jacobsen and 256 — 4802 — 2104 
Waadeland [26], 1985 256 ~ 1122 + 34 


Ramanujan 


[44], [45], 1914 10+ V4 — 3% 


The two approximations by India’s great mathematician, S. Ramanujan, were 
first stated by him in his notebooks [46, p. 217], and then later at the end of his 
paper [44], [45, p. 39], where he says that they were discovered empirically. 
Ramanujan [44], [45] also provides error approximations, but they are in a form 
different from that given here. Since 


oD. Lay hae e? 
ath 14+Vi-e 4’ 


we find that, for the first approximation, 


< 20a—- = ra= 


rn 
m(a + b)=5 = na(l + yl -e?) aa 5° 


which is the approximate error given by Ramanujan. Similarly, for the second 
approximation, Ramanujan states that the error is approximately equal to 


(e2/4)° giz al 
2? 


e 20 


30a 336 . 
which is in agreement with our claim. The exactness of Ramanujan’s second formula 
for eccentricities that are not too large is very good. For example, for the orbit of 
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Mercury (e = 0.206), the absolute error is about 1.5 x 10~!° meters. Note that if 
we set b = 0 in Ramanujan’s second formula, we find that 7 ~ 22/7. 
Fergestad [49] rediscovered Ramanujan’s first formula several years later. 
Despite Ramanujan’s remark on the discovery of these two formulas, Jacobsen 
and Waadeland [26] have offered a very plausible explanation of Ramanujan’s 
approximations. We confine our attention to the latter approximation, since the 
arguments are similar. Write 


F : : 1; 1 . 28 
—~,-=31;] =1+ ——_.. 
| ae a 4(1 + w) 8) 
Then it can be shown that w has the continued fraction expansion 

3 3 1 


If each numerator above is replaced by —3A”/16, then we obtain the approximation 
1 
w= 5(-24 va-3e). 


Substituting this approximation in (28) and then using (25), we are immediately led 
to the estimate 
L(a, b) ; 3N 
——_ = 1+ — =. 
m(a+b) 10 + ¥4 - 3¥ 
Since Ramanujan’s facility in representing analytic functions by continued fractions 
is unmatched in mathematical history, it seems likely that Ramanujan discovered his 
approximations in this manner. 
In the next section, we examine some approximations for L(a, b) of a different 
type given by Ramanujan in his notebooks [46]. 
7. Further approximations given by Ramanujan. In his notebooks [46], Ramanu- 
jan offers some very unusual formulas, expressed in sexagesimal notation, for 
L(a, 6). The first is related to his approximation 3 — ¥4 — 2’ given in Section 6. 


THEOREM 7. Put 


1 
(a,b) = (a +6){1 + Asin? 58), 0<6<7/4, (29) 
and 
in? = dX si alt 30 
sin@ = A sina, meray (30) 


Then, when the eccentricity e = 1, a = 30°18'6”, and as e tends to 0, a tends 
monotonically to 30°. 


It is not clear how Ramanujan was led to this very unusual theorem. The variance 
of a over such a small interval is curious. 


Proof. We shall prove Theorem 7 except for the conclusion about monotonicity. 
However, we shall show that a > 7/6 always. 
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For brevity, we write (25) in the form 
L(a, b) o° 
oe ", A| <1. 31 
aay [Al (31) 
It then follows from (29) and (30) that 
1 e) 
3 — 2¥1 — Msin’'a = 1 + 4sin’0 = Yar", |A| <1. (32) 
n=0 
Next set 


3-vy4-N = EBX, |Al <2. (33) 
n=0 


As implied in Section 6, a,, = n= 0,1,2. We shall further show that, for n > 3, 
B, < o,/2"~?. (34) 


From the definitions (31) and (33), respectively, short calculations show that, for 
n>, 


ne 


G41 _ (2n—1)° Busy 2n-1 
= ——— and = ——.. 
a, (2n + 2) B, 8(n + 1) 
Thus, 
By a1 On+1 n+] 1 


=> COU i 
B, | a,  2(2n-1) ~ 2’ 
if n > 2. Proceeding by induction, we deduce that 
B41 1 B, 1 
<x Saat 


—<«< 
— s n-1? 
+1 2 a, 2 


a, 


for n > 2, and the proof of (34) is complete. 
From (32) and (34), it follows that 


3-—y¥4- <3-2y1 —-Wsin’a. 


Solving this inequality, we find that sin’a > 1/4, or a > 7/6. 
Second, we calculate a when e = 1. Thus, A = 1 and @ = a. Therefore, from (25) 
and (32), 
1 4 dain? F : raed i (35) 
sin 5a = Fl-5,- 53151) = —. 
This evaluation follows from a general theorem of Gauss on the evaluation of 
hypergeometric series at the argument 1 [4, p. 2]. Moreover, this particular series is 
found in Gauss’s diary under the date June, 1798 [24]. Thus, 


1 


1 1 
sin’ —a = — — — = 0.0683098861. 
2 7 4 


It follows that a = 30°18’6”. 
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Third, we calculate a when e = 0. From (30) and (32), 


1 
sin’6 4sin’ 58 
lim sin’?a = lim = in —— 
0 ee avo Mw 
90 1 
= limA7* ¥ aX"=a,= -. 
A=+0 La, tg 


nw] 
Thus, a tends to 1/6 as e tends to 0. 

Ramanujan [46, p. 224] offers another theorem, which we do not state, like 
Theorem 7 but which appears to be motivated by his second approximation for 
L(a, 6). 

Ramanujan [46, p. 224] states two additional formulas each of which combines 
two approximations, one for e near 0 and the other for e close to 1. Again, we give 
just one of the pair. A complete proof of Theorem 8 below would be too lengthy for 
this paper, and so we shall just sketch the main ideas of the proof. Complete details 
may be found in [7]. 


THEOREM 8. Set 


tan 6 
L{a,b)=a(at b> 0<0<7/2, (36) 
and 
6=2 ie 37 
tan @ = Acos «, genres (37) 


Then as e increases from 0 to 1, a decreases from 2/6 to 0. Furthermore, a is 
approximately given by 


2Vab (va ~ yb) a— by? 
30° + 6°18'49” ———_——— — 1°10’55’”’| —— | }. 
ae at+b o> teal, (38) 


Proof. If ¢ = 0, then X = 0 and @ = 0. The argument is very similar to that in 
the proof of Theorem 7, and we find that 


lim cos*a = 3a, = 3/4. 
A430 
Thus, a = 7/6 when A =0 =e. 
We next determine a when e = 1. Thus, A = 1 and tan @ = cos a by (37). From 


(25), (36), and (35), 


tan 6 
6 


A=1 


Thus, 6 = 7/4 and a = 0. 

It appears to be extremely difficult to show that as A goes from 0 to 1, @ 
monotonically decreases from 7/6 to 0. It can be shown [7], however, that 0 < a < 
2/6, always. A proof depends upon a continued fraction for tan~ ‘x. 


8. GAUSS, LANDEN, RAMANUJAN, AND THE LADIES DIARY (1988) 147 


1988] GERT ALMKVIST AND BRUCE BERNDT 605 


The proof of (38) is very difficult, and we provide only a brief sketch. We observe 
(again) that 


— 2yab 
a acer 
and so 
2 dy AAP _ Jpy 
ee ato pia — 1)" 


Thus, Ramanujan is attempting to find an approximation to a of the form 
V1-¥ (44+ B{1-V1-} +c), (39) 


which will be a good approximation both when ) is close to 0 and when dD is near 1. 
Our task is then to determine A, B, and C. 
With a considerable amount of effort, it can be shown that [7] 
a 21y3 


Ss a, ey 4 
Hea k POM) (40) 


in a neighborhood of A = 0. The proper expansion near A = 1 is even more difficult 
to obtain because F(— oe - ie 13 ’) is not analytic at A = 1. However, there does 
exist an asymptotic expansion for F(— 4+, — 4;1; 4”) as A tends to 1—, and 
employing this, we can show that [7] 


4-— a 
a= | i 1-¥ + o0(y1-2’), (41) 
20 —4 
as A tends to1 —. 


Having omitted the hard analysis, we now determine A, B, and C from (40) and 
(41) with little difficulty. When A tends to 0, (39) tends to A. Thus, A = 7/6, by 
(40). Next, examine (a — 7/6)/d as A tends to 0. From (39) and (40), we find that 


T ae 21y3 
-y+7B+C= -—. 
D2 160 


Now check a/ ¥1 — 2? as A tends to 1 — . From (39) and (41), we see that 


T+ BtC V snd 
6 VW 4° 


Simultaneously solving these last two equalities, we conclude that 


4—-a 21y3  @ 
B=2 mai. wm 
and 
Ci 4- a 2173 
C= 3 | 80 = —0.0206291. 


Converting A, B, and C to the sexagesimal system and substituting in (39), we 
complete the proof. 
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Although Ramanujan is well known for his approximations and asymptotic 
formulas in number theory, he has not been adequately recognized for his deep 
contributions to approximations and asymptotic series in analysis, because the vast 
majority of his results in the latter field have been hidden in his notebooks. These 
notebooks were begun in about 1903, when he was 15 or 16, and are a compilation 
of his mathematical discoveries without proofs. The last entries were made in 1914, 
when he sailed to England at the urging of G. H. Hardy. Although the editing of 
Ramanujan’s notebooks was strongly advocated by Hardy and others immediately 
after Ramanujan’s death in 1920, it is only recently that this has come to fruition 
[6]. 

We have not attempted to give complete proofs of some of the theorems that we 
have described, but we hope that the principal ideas have been made clear. We have 
seen that a chain of related ideas stretches back over a period exceeding two 
centuries and provides impetus to contemporary mathematics. Ideas and topics that 
appear disparate are found to have common roots and merge together. For further 
elaboration of these ideas, readers should consult the works cited, especially Cox’s 
paper [18], the papers and book of J. M. and P. B. Borwein [8]-[13], a paper by 
Almkvist [1} written in Swedish, and Berndt’s forthcoming book [7]. 

We are most grateful to Birger Ekwall for providing some very useful references. 
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Synopsis: 

In this paper, Yasumasa Kanada describes the computation of 7 to over 200 
million decimal digits on a Hitachi S-820 vector supercomputer in Japan. Kanada 
employed what he termed the Gauss-Legendre formula, which is very similar to the 
formulas found by Salamin and Brent, and, for a check, by using Borwein quartic 
algorithm (the same algorithm earlier employed by Bailey). 

Kanada provides very detailed information on exactly how he implemented 
these formulas, how he performed high-precision multiplication using a fast Fourier 
transform, and, how he accelerated his code to run at the maximum possible per- 
formance on the Hitachi supercomputer. Kanada also provides an interesting sta- 
tistical analysis on these 200 million digits. 
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Vectorization of Multiple-Precision Arithmetic Program 
and 
201,326,000 Decimal Digits of x Calculation 


Yasumasa Kanada 


Computer Centre, University of Tokyo, 
Bunkyo-ku Yayoi 2-11-16, Tokyo 113, Japan 


More than 200 million decimal places of x were calculated using arithmetic-geometric mean formula 
independently discovered by Salamin and Brent in 1976. Correcmess of the calculation were verified 
through Borwein’s quartic convergent formula developed in 1983. The computation took CPU times of 5 
hours 57 minutes for the main calculation and 7 hours 30 minutes for the verification calculation on the 
HITAC S-820 model 80 supercomputer with 256 Mb of main memory and 3 Gb of high speed semicon- 
ductor storage, Extended Storage, for shorten I/O time. 


Computation was completed in 27th of January 1988. At that day two programs generated values up 
to 3x26, about 201 million. The two results agreed except for the last 21 digits. These results also agree 
with the 133,554,000 places calculation of m which was done by the author in January 1987. Compare to 
the record in 1987, 50% more decimal digits were calculated with about 1/6 of CPU time. 


Computation was performed with real arithmetic based vectorized Fast Fourier Transform (FFT) 
multiplier and newly vectorized multiple-precision add, subtract and (single word) constant multiplication 
programs. Vectorizations for the later cases were realized through first order linear recurrence vector 
instruction on the §-820. Details of the computation and statistical tests on the first 200 million digits of 
n-3 are reported. 


1, Introduction 


Since the epoch-making calculation of x to 100,000 decimals [17], several computations have been 
performed as in Table 1. The development of new algorithms and programs suited to the calculation of m 
and new high speed computers with large memory and high speed large semiconductor disk, Extended 
Storage or Solid State Disk, threw more light on this fascinating number. 


There are many arctangent relations for [9]. However, all these computations until 1981 and 
verification for 10,000,000 decimal calculation of our previous record[21] used arctangent formulae such 
as: 


1 1 
= —-4 — . 
m= 1l6arctan 5 arctan 739° Machin 
= SMorcan= + Rake + distin Stormer 
8 57 239 ” 
1 1 1 
= 48 a eek a 
arctan 18 + 32arctan 37 20arctan 739° Gauss 
1 1 1 
= — -4, ee as : . * 
32arctan 10 arctan 739 l6arctan-— Klingenstierna 


In 1976, an innovative quadratic convergent formula for the calculation of m was published indepen- 
dently by Salamin [14] and Brent [5]. Later in 1983, quadratic, cubic, quadruple and septet convergent 
product expansion for m, which are competitive with Salamin’s and Brent’s formula, were also discovered. 
by two of Borwein[2]. These new formulae are based on the arithmetic-geometric mean, a process whose 
rapid convergence doubles, triples, quadruples and septates the number of significant digits at each step. 
The arithmetic-geometric mean is the basis of Gauss’ method for the calculation of elliptic integrals. With 
the help of the elliptic integral relation of Legendre, m can be expressed in terms of the arithmetic- 
geometric mean and the resulting algorithm retains quadratic, cubic, quartic and septic convergence of the 
arithmetic-geometric mean process. 
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The author and Mr. Y. Tamura have calculated m up to mare than 200 million decimal places by 
using the formula of Brent and Salamin and verified through Borwein’s quartic convergent algorithm for x. 
Even for the quartic convergent algorithm, quadratic convergent algorithm of Brent and Salamin is faster 
through actual FORTRAN programs. 


For reducing the computing time, theoretically fast multiple-precision multiplication algorithm was 
implemented through normal fast Fourier transform (FFA), inverse fast Fourier transform (FFS) and convo- 
lution operations[{11] as before[19,9,21,8]. And in order to get more speed than before, vectorization 
schemes to the multiple-precision add, subtract and (single word) constant multiplication were introduced 
for the first time. 

Calculation of 200 million decimal places of m was completed in January 27, 1988 and needed 5 
hours 57 minutes of CPU time on a HITAC S-820 model 80 supercomputer at Hitachi Kanagawa Works 
under a VOS3/HAP/ES 31 bit addressing operating system. Main memory used was about 240 Mb and 2.7 
Gb of Extended Storage was also required. If the machine had more memory, CPU time could be reduced 
and more Extended Storage, calculation decimals could be extended with minor changes in the FORTRAN 
programs. 

The algorithms used in the 200 million place calculation are briefly explained in section 2. Program- 
ming in FORTRAN is discussed in section 3. Results of statistical tests and some interesting figures for 
the 200,000,000 decimals of x appear in section 4. 


2. How to Calculate x: Algorithmic Aspects 
In this section we briefly explain the algorithms used in the 200 million decimal place calculations. 


2.1. The Gauss-Legendre Algorithm; Main Algorithm 


The theoretical basis of the Gauss-Legendre algorithm for x is explained in the references [2,5, 14]. 
Here, we summarize the quadratic algorithm for m. (Refer to the references for the details.) 

We first define the arithmetic-geometric mean agm(ao,bq). Let ao, bo and Co be positive numbers 
satisfying aj = b§ +c}. Define a,, the sequence of arithmetic means, and b, , the sequence of geometric 
means, by 
2. (Gn-1 + 5,-4) 


ay 5 


1 On = Gq-y X B41), 
Also, define a positive number sequence c,: 
ee =a2—-b?. (63) 
Note that, two relations easily follow from these definitions. 
cer (@,4 a b,-1) 
Of: 2 
Then, agm({aq,bp) is the common limit of the sequences a, and b, , namely 
agm(ay,b>) = lima, = limb, . 
ao Ae 


= (Ga-1- Ga), on = 4X Gag X Coat - (2) 


Now, 7 can be expressed as follows: 
spies dagm (Ld agm(1k) : (4) 
1- FCP +c?) 
fA 
where ag =@'g=1, bo =k, bp =k’ and k?+k%=1. It is easier to compute squares of c; and c’; by 
Eq. (2) than to calculate c/ and c’? by Eq. (1). 


The symmetric choice of k = k’ = 2"! is recommendable for the actual calculation and it causes the 
two sequences of arithmetic-geometric means to coincide. Then, Eq. (3) becomes 


ais A(agm (1,2)? ; (4) 


1~ Se? 
jal 


After n square root operations in computing agm = agm(1,2-"”), 1 can be approximated by 7,: 
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4a2 a, 
t= ——_—— =§ ——_"*—_.. (5) 


a Z n , 
1- Yc? ~— 0.25 - He? 
j=l j-l 
Then, the absolute value of  — 1, is bounded as follows (Theorem 2b in reference [14)) : 


1 —T, < (nx2"4/agm) exp(-1x2"*) . (6) 


Thus, the formula has the quadratic convergence nature. We must note here that all operations and con- 
stants in Eq. (4) must be correct up to the required number of digits plus o (20 to 30 as the guard digits for 
200 million decimal digits calculation). 


Then the sequences of agm and agm related x are calculated by the following algorithm, 
As 1;B := 2" T = 1/4, X = 1; 


while A—B>2™ do begin 

W := A*B; V =A; A = A+B: A = Al2; 
V = V-ASV = V"V; V o=V"X; 

T = T-V;B = VW;X = 2*X end; 


A :=A*B;B := VT; A := A*B; 
return A . 


Here, A,B, T, V and W are full-precision variables and X is a double-precision variable. After twenty 
eight iterations of the main loop, x to 200 million decimal places is to be obtained. 


2.2. Borwein’s Quartic Convergent Algorithm: Verification Algorithm 
Borwein’s quartic convergent algorithm is explained as the following scheme: 


@9=6-4x V2, yo= 2-1 


1=G-=y)" 4 5 2k+3 
Yeu = T+d+y)*" Oga1 = Op X (1+ Year)” — 2 x Year X (1 + Year + Yea), 
k 


n= aa for large k. 
a 


Here, precisions for a, and y, must be more than the desired digits. This algorithm is basically the same 
with that used for the main run of the 29 million decimal calculation done by Dr. D.H. Bailey[1]. Dr. Bai- 
ley used the following Borwein’s quadratic convergent algorithm for the verification calculation: 


ay= V2, by =0, po=2+v2 


1 
¢ a + —=) 
_ Nay rs — Ja, x (1 + by) _ Pe X Best X (1 + Gest) 
rr i, a a Pre = 1+ dea 


m=p, for large k. 


Here, precisions for a,, 6, and p, must be more than the desired digits. It is to be noted that the Gauss- 
Legendre algorithm is superior to the Borwein’s algorithms explained here as in the Table 2. 


2.3. Calculation of Reciprocals and Square Roots 


As is easily seen from the algorithms explained above, arithmetic operations, reciprocals and square 
roots must be computed with high efficiency in order to reduce the computing time. Theoretical bases for 
fast calculation of reciprocals and square roots of multi-precision numbers appear in references 
[16,12,4,5]. In this subsection we summarize the algorithms used in the actual calculation. 


The reciprocal of C is obtained by the Newton iteration for the equation f (x) =x —C = 0: 
Kin. = Hi X(2-C XH). (7) 
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A single-, double- or quadruple-precision approximation of 1/C’ is a reasonable selection for xo, i.e. initial 
starting value for the Newton iteration. 

Square roots of C should be calculated through the multiplication of C and the result obtained by the 
Newton iteration for the equation f (x) = x7? - C =0: 
xX B-C xx?) 

5 £ 

In this case also, a single-, double- or quadruple-precision approximation of 1/VC is a reasonable selection 
for Xo, initial starting value. 

Compare this to the well known Newton iteration for the equation f (x) = x? - C = 0: 


(8) 


Aja = 


@ + a 
i= 9) 


The iteration of Eq. (8) is better in both computing complexity and in actual calculation than the iteration 
of Eq. (9). The order of convergence for the iterations of Eq. (7) and Eq. (8) is two. This convergence 
speed is favorable in the actual calculation. It is to be noted that in these iterations the whole operation 
need not to be done with full-precision at each step. That is, if k-precision calculation is done at step j, 
2k -precision calculation is sufficient for at step j+1. 


2.4. Multiple-precision Multiplication 

Schénhage-Strassen’s algorithm [15], which uses the discrete Fourier transform with modulo 2” + 1, 
could be the key multiple-precision multiplication algorithm for speeding up the 7 calculation. However, 
this algorithm is so hard to implement and needs binary to decimal radix conversion for the final result. 
Dr. Bailey also used discrete Fourier transform but with three prime modulo computation followed by the 
reconstruction through Chinese Remainder Theorem for his 29 million decimal places calculation[1]. 

Now, we focused the special scheme which utilizes the fact of "the Fourier transform of a convolu- 
tion product is the ordinary product of the Fourier transforms.” 

Let consider the product C of two length n with radix X integers A and B. Note that the radix X 
need not to be a power of 2. 


dat > 
A= YaX' B= Dx, 
t=O i= 


where O<a;, <X,0¢6,<X for O<i<n-l, 0<a,.,<X,0<6,.,<X and a; =6;,=0, for 
i<O,an Si. 


Then, 
2n~-) ; a-l ss 2a-l 2a-l 2a-1 Z 
Cz=A°B =(¥ aX"): (3 4;X/)= Dd X'S 4;b;-;) = > ix’ 
i=0 jx tz j=d i=0 


Thus, 
2a-l 
ci = E 4b;-;, for i =0, teeee ,2n-2, and Cm) = 0. 
j= 


If w = exp (2ni/2n) is a 2nth root of unity, the one-dimensional Fourier transform of the sequences 
of complex numbers (ao, G1, °"" 4 Gon) (bo, bi; aes » Dont) and {Co Cy "o's £2,~1) are defined as 
the sequences of (49, 41, °°* .@2q,-1), o, 5), °°° » Ban-1) and (€, 6), +--+ . Eons), Tespectively. 
Here, 

2-1 ae oe 2a-1 , 2a-\ . 
4 = Yiao",5,= Yb0",%= Yew, for O<i <a-1. 
i= i= i 
Then, (Gobo, 4,5), Se TSS og abe) is equal to (Co, é1, I C2n-1) as the followings: 
re Qn-1 | ; Qari dari 0 Qael 2nel 2a-1 
4b =( Daw") (Ybo%) =F Yabo = FY (Yab_o* = Heo" 
i=0 j= a) t=O j=0 t= 
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In these discussions, radix X might be an any number. However, @ must be a complex number, 
namely floating point real arithmetic operations are needed in the process of Fourier transformation. If and 
only if all the floating point real arithmetic operations are performed exactly, this scheme should give the 
correct result as discussed in page 290-295 of reference[11]. Compared to the discrete Fourier transform 
based multiplier, floating point real arithmetic operations based Fourier transform seems ideal and 
dangerous, but attractive as for the actual multiple-precision multiplication method. 


The reasons are: 


1) The speed of double precision floating point operations is faster than integer operations in the 
available machine, especially for supercomputers. And in general, the number of bits obtain- 
able in one double precision floating point instruction is longer than that obtainable in one 
single integer instruction. 


2) Conversion from binary results to decimal results needs other techniques and coding. (Simple 
is best. SchOnhage-Strassen’s discrete FFT algorithm is the binary data multiplier.) 

3) A qualified high-speed FFT routine was available as a library. (Qualified programming 
improves the reliability of the program. If we adopt SchSnhage-Strassen’s discrete FFT algo- 
rithm, we have to code for it.) 


4) Chinese remainder based discrete FFT algorithm is not so fast as far as the Dr. Y. Ushiro’s 
experiment in 1983[20] is concemed. (He showed the inferiority of Chinese remainder based 
discrete FFT to us prior to the Dr. Bailey's experiment[1]. We needed a faster multiple- 
precision multiplier.) 


Followings are the algorithm used in our calculation. Here, let consider the multiplication of two 
m X 2" bit (= m x (log,92) x 2” decimal digit) integers A and B through our schemes. 


Step 1: Prepare two 2 x 2* entry double precision floating point array. 


Step 2: Convert both of m x 2" bit integers into double precision floating point numbers. (The 
first half of 2 x 2" entry contains information for m x 2” bit, namely, m bit information 
per one double precision floating point array entry.) 

Step 3: _ Initialize to double precision floating point zero for the second half of 2 x 2" entry. 

Step 4: Apply 2"*! point normal Fourier transform, say FFA, operations to A and B giving A‘ and 
B’, respectively. 

Step 5: Do the convolution product operations between A‘ and B’ giving new 2 X 2" entry double 
precision array C’. 

Step 6: | Apply 2"*) point inverse Fourier transform, say FFS, operations to C’ giving C. (Now, C 
is the double precision floating point array of 2 x 2” entry. If operations FFA, FFS and 
convolution product are performed in infinite precision, each entry of C should be the exact 
double precision floating point representation for integer with maximum value of 
2" x (2"-1)°, However, these representation are slightly deviated from exact integer in the 
actual operation. Because, infinite precision operations are impossible to perform.) 

Step 7: Convert each entry of C (let it to be x) into integer representation. (Conversion should be 
done with IDNINT operation in FORTRAN. (IDNINT(x) = IDINT(x+0.5D0).) If absolute 
value of (c-DFLOAT(IDNINT (x))) is near to 0.5D0, the multiplication is considered to be 
incorrect. We don’t know that criterion of 0.45D0 is sufficient or not. However criterion 
of 0.2D0, for example, was sufficient enough in the actual multiplication.) 


Step 8: Normalize C under the suitable base. The base of 2" or 10%!” is better for binary or 
decimal representation. Final result is the result of multiplication between A and B. 


According to the theoretical and experimental analysis of error in FFT [6,10], FFT attains rather 
stable error behavior. Theoretically roughly speaking, 2' point FFT attains ! log (1) bits error at the max- 
imum (worst case). This means that FFA and the convolution product followed by FFS would acquire 
2 1 log(!) bits error at the maximum, even if the trigonometric functions at coefficients for the butterfly 
operation are calculated exactly. 


Available mantissa bits are 56 for double-precision floating point data of the Japanese supercomput- 
ers. (CRAY and ETA machines adopt 48 bit mantissa representation for the floating point numbers.) Then, 
the following equation must be satisfied for the worst case; 
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bits available in the calculation 2 (guard bits for FFA and FFS operations) + 
(necessary bits for preserving the results) 
56 > 2 (ntl) log (n+l) + log (2"x(2"-1)’) . 
Here a is an integer. Thus, even for m = 1, 
ns7. 


The analysis suggests a maximum n of 7. However, the maximum n depends on the method of program- 
ming, errors in trigonometric function calculations, value of m, etc. (There is another error analysis for the 
floating point real FFT multiplier in reference[{11] which specifies the radix X in our explanation to be the 
power of 2.) 


The error analysis for the actual calculation is hard. Then, we have checked the availability of the 
FFT multiplier based on the above schemes through actual programs. We monitored the deviations from 
integer values after the stage of conversion to integer representation as explained in Step 7 of the above 
explanation. The monitoring secures a maximum n of 24, for the condition of one data point, a double 
word, holds 3 decimals at the maximum (m is about 10) on the first half of 2 x 2” data points and zero on 
the second half of 2 x 2% data points. Then we decided that the maximum length of multiplicand to be 
multiplied in-core is 3 x 2% decimal places (about 50 million decimals) in the actual multiplication. 


For multiplying 3 x 2% decimal places numbers (multiple-precision data for 200 million decimal 
places), we used the classical O(n?) algorithm, e.g. school boy method, for the data of 276/274 =2?=4 
units with base of 24 x 1,000! It is possible to realize multiple-precision multiplier all through FFT, not 
through school boy method. In order to do so, we must write extra program for out-of-core version of 
FFT. We preferred to utilize the reliability of the numerical libraries. 


2.5. Vectorization of Multiple-Precision Add, Subtract and (Single Word) Constant Multiplication 


For the programs of multiple-precision add, subtract and (single word) constant multiplier, time con- 
suming process is releasing the “borrow or carry.” 


If the machine has special instruction for vectorizing first order linear recurrence relation of the fol- 
lowing, the process of releasing the borrow or carry could be vectorized. (Now, all of the Japanese super- 
computers are equipped with such instructions.) 


DO 10T=... 
10 Ad) =B() * Ad-1) + CM 
Here, vector A has a nature of first order linear recurrence relation. The following program which were 
extracted form the source codes of the actual run will explain how to rewrite the program for multiple- 


precision adder. Same strategies can be applied to the multiple-precision routines for subtract and (single 
word) constant multiplication. 


C integer*4 -- MA. MB. IC¥{=carry), [CWt=work), IONE8(=base=10##6) 
C real*8 -- Y¥C=work), Z(=carry), C¥(=work), CWC=work) 
C real*8 -- ONEDS=DFLOATCIONES). QNEDM8=1.D00/0NE08, HLFOM8=.500«GNEDMB 


DG 10 J=NDA,1,-1 DO 10 J=NDA.2,~-1 

ICW=MA (J>+MBCUD+ICY ¥CJ> SMA CJD +MBCJ) 

IF¢ ICW.GE.IONE& > THEN ZCJ-1) =¥ CJ) #ONEDMS+Z (J) *ONEDMS 
MACJ>=ICW-LONES Z¢(J>SDINT(Z CJ) +HLFDM8) 
Icy=1 10 CONTINUE 

ELSE ==> Z(1)=DINT(Z (J) +HLFDMS) 
MACS) =ICW DO 11 J=NDA,2,~-1 
Icy=0 MACJPS¥CJ0 +200) -Z2 CJ-1) @ONEDS 

END IF 11 CONTINUE 

10 CONTINUE CW=DFLOAT (MA (1)4MB(1))42¢614) 


CY=DINT (CW#ONEDMS+HLFDM8) 
MA(1)=CW-CY*ONEDS 
3. How to Calculate x: Programming Aspects 


The actual programs, written in FORTRAN, consisted of 3426 (main) and 3642 (verification) lines of 
source code with comment lines. The numbers of program units are 59 (main) and 64 (verification). In 
this section we briefly overview the actual programming. 
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About the half of the sub-programs are routines related to multiple-precision multiplication. Others 
are routines for addition, subtraction, reciprocal operations, square root operations, file I/O operations, etc. 


The HITAC S-820 model 80 is a so called 2nd generation supercomputer in Japan. The machine is 
single processor model and has theoretical peak performance of 3Gflops. Maximum attachable main 
memory size is 512 Mb and Extended Storage size is 12 Gb. FFT routines, FFA and FFS, are carefully 
selected from the non-vectorized FORTRAN numerica! library which was provided by the Hitachi and 
source codes were slightly modified by myself for the vectorization. 


3.1. Layout of Storage 


As explained in section 2, we used a floating point real FFT whose accuracy is secured under the 
conditions of 


(1) 2 point double-precision floating point real FFT, 


(2) maximum number at each entry point for the first half entry is non negative number and 
must be bounded by 1,000, 


(3) the second half entry contains zero. 


These conditions allow in-core multiplication of numbers with 2% x 3 ( = 50,331,648) decimals. If we 
want more decimals to be calculated in-core (on main memory), we must reduce the above conditions, e.g. 


(1) 2! points -> 2?! points, 
(2) maximum number at each entry point of 1,000 -> 100. 


These new conditions would allow numbers of up to 27’ x 2 (= 1,073,741,824) decimals to be manipulated 
in-core. (We did not check the validity of these conditions through actual run. Our examples are only for 
explanation!) However, currently available maximum memory size prevent such higher precision calcula- 
tions in-core. In order to run with such ideal conditions, at least 8 GB of main memory should be avail- 
able. Now, the cases with 2!” points or 2! points and maximum number at each entry point of 10,000 
were applied for the previous our 7 calculations including 133 million decimals record. The increase of 
in-core operable FFT points is the major factor for the speed-up to the x calculation. 


It was impossible to obtain 200 million decimal places through in-core operations because of avail- 
able main memory size. Therefore we introduced the user controlled virtual memory scheme for saving 
high precision constants of 1/V2, V2, x, and several working storage with compression factor of 6 decimals 
/ 4 bytes. (integer representation was used for saving storages on the extended storage. For FFT, a 3 
decimals / 8 bytes scheme - double precision floating point representation - was needed as explained.) 
These schemes needed about 240 Mb of main memory for the working storage, input and output (1/0) 
buffer, object codes, etc. As for the extended storage size, 13.5 Mb / 1 million decimal places was needed. 


3.2. Optimization for Speedup 
We have employed several optimization schemes: 


1) Multiplication by 1 -> normal copy operation. 

2) Multiptication by 2 -> normal addition operation. 
3) Deletion of unnecessary FFA and FFS operations. 
4) Reuse of internal iteration results. 


For 1) and 2), explanation is simple enough. In the following two subsections, we explain the details of 3) 
and 4). 


3.2.1. How to Delete FFA and FFS Operations 


As explained in section 2.4, the school boy method was employed for multiple-precision multiplica- 
tion (for data length longer than in-core operable length). Now as for an example of 4 x 4 school boy 
method, let consider the simple case of 2 x 2. That corresponds to the multiplication of two integers, both 
with a length of 3 x2” and in-core operable length of 3 x 2%. Without optimization, each 2% x 1,000 
decimal based multiplication needs eight FFA for input and four FFS for output as follows; 


(A, x BASE + A) x (B, x BASE +B») 
=(FFS of (FFA of A,:FFA of Bj)) x BASE? 
+((FFS of (FFA of A, FFA of B2))+(FFS of (FFA of A: FFA of B,))) x BASE 


123 


160 9. 201,326,000 DECIMAL DIGITS OF PI CALCULATION (1988) 


+FFS of (FFA of Az- FFA of B2)), 


where - is the convolution product for the Fourier transformed data, + is the vector-wise addition and 
BASE is 2% x 1,000. 
There are very many FFA and FFS operations. These operations can be reduced a lot by the follow- 
ing schemes: 
1) First apply FFA operations to A;, A, B, and B>, Let the results be Aj’, Ay’, B,’ and By, 
respectively. 
2) Now (A)’: By’ + Ay’ « By’) preserve information in the sense explained in section 2.4, 
3) Then, the linearity of FFS operation satisfies the following equation: 
(A; X BASE + A>) X (B, x BASE + B) 
= (FFS of (Ai’ : By’) x BASE? + (FFS of (A,’ > Ba’ + Aq’: By')) x BASE 
+ (FFS of (A2’- Bo’). 


According to the results of CPU time profile analysis for 33 million decimal places calculation, multiplica- 
tion occupies 90% of CPU time. (Here, CPU time does not contain the time for input and output opera- 
tions.) Thus, this optimization is rather efficient when the length of multiplicand becomes long. (In this 
case, FFA operations was reduced from 8 to 4 and FFS operations was reduced from 4 to 3.) 


3.2.2. Reuse of Internal Iteration Results 


Newton’s iteration for square roots and reciprocals has second order convergence nature. This 
implies that internal iteration results at the calculation of the half-length precision of = can be the initial 
value for the iteration at the calculation of the full-length precision of x. (And this selection is the best 
selection for reducing CPU time.) According to measurements of CPU time, this scheme reduced the CPU 
time 10-20% for the 16 or 33 million decimal places calculation. 


If we utilize this fact, we can reduce the computation time probably by 10-20%. To do so, however, 
we had to prepare the permanent storage of about 1.5 Gb. This was completely impossible at the time of 
actual run. For the history of 2 calculation, we had saved the internal iteration results from the calculation 
of 16 million decimal places (data size is around 100 Mb). That data helped for the calculation of 32 mil- 
lion decimal places of x substantially. And also for the calculation of 16 million decimal places of 1, we 
had utilized the internal iteration results from the calculation of 8 million decimal places. 


4. Statistical Analysis of 200,000,000 Decimals of x 


In order to analyze the statistics of 200,000,000 decimals of , we used the same statistical tests as 
Pathria[13], except that the number of digits expanded from 100,000 to 200,000,000. In this section we 
briefly show the statistical data and some interesting figures from the 200,000,000 decimals of x. Now, the 
200,000,000-th decimal number of 1 ~— 3 is 9. 


4.1, Results of Statistical tests 


Five kinds of statistical tests were performed. These are the frequency test, the serial tests, the Poker 
hand test, the gap test and the five-digit sum test. We have reproduced only the results of frequency test in 
the Table 3, The other results are to be published through the reference[7]. 


4.2. Some Interesting Figures 
Analysis of digit sequences for 200,000,000 decimal places of x — 3 gives some interesting figures; 
1) A longest descending sequence of 2109876543 appears (from 26,160,634) only once. The next 


longest descending sequence is 876543210 (from 2,747,956) only. The next longest descending 
sequence of length 8 appears 9 times. 


2) The longest ascending sequence is 901234567 which appears from 197,090,144. The next 
longest ascending sequence is 23456789 (from 995,998), 89012345 (from 33,064,267, 
39,202,678, 62,632,993 and 78,340,559), 90123456 (from 35,105,378, 44,994,887, 98,647,533 
and 127,883,114), 56789012 (from 100,800,812 and 139,825,562), 67890123 (from 
102,197,548, 135,721,079 and 178,278,161), 01234567 (from 112,099,767), 78901234 (from 
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119,172,322, 122,016,838, 182,288,028 and 195,692,744), 12345678 (from 186,557,266) and 
45678901 (from 194,981,709). The next longest ascending sequence of length 7 appears 170 
times, 


3) A sequence of maximum multiplicity (of 9) appears 3 times. These are 7 (from 24,658,601), 6 
(from 45,681,781) and 8 (from 46,663,520). The next longest sequence of multiplicity (of 8) 
appears 16 times, 


4) The longest sequence of 27182818 appears from 73,154,827, 143,361,474 and 183,026,622. 
The next longest sequence of 2718281 appears 22 times. 


5) The longest sequence of 14142135 appears from 52,638, 10,505,872 and 143,965,527. The 
next longest sequence of 1414213 appears from 13,816,189, 40,122,589, 72,670,122, 
87,067,359, 104,717,213, 115,301,872, 145,035,762, 147,685,125, 155,299,021, 165,871,476, 
166,005,277, 166,491,213 and 191,208,533. The next longest sequence of 141421 appears 169 
times. 


6) The longest sequence of 31415926 appears 2 times. These are from 50,366,472 and 
157,060,182. The next longest sequence of 3141592 appears 7 times. 


5. Conclusion 


Details of 201,326,000 decimal digits of n calculation and some results of statistical tests on 
200,000,000 decimals of m are presented. The original program is written in FORTRAN 77 and heavily 
utilizes floating point operations. Multiple-precision add, subtract and constant multiplication programs 
were also vectorized through linear recurrence special instruction. Programs do not depend on the scheme 
of round-off and cut-off to the results for the floating point operations. Then, not only round-off machine, 
€.g. supercomputers of CRAY Inc. and ETA systems, but also cut-off machine, e.g. Japanese supercomput- 
ers, Can generate correct m digits. The program calls few system specific subroutines, e.g. CLOCK, TIME, 
but adaptation to the new machine is easy. In order to get speed, available main memory size is crucial 
and shorten elapsed time, availability of high speed I/O devices is also crucial. Thus, the 7m calculation 
program based on the scheme explained here can be a good benchmark program for the supercomputer. 


We have programmed the fast multiple-precision multiplication through a floating point real FFT 
package, which was available as one of the program libraries. This means that half a hundred lines of code 
is sufficient for the fast multiple-precision multiplication. (A few lines of array declaration, one line for 
calling the FFA routine, a few lines of convolution products, one line for calling the FFS routine and a few 
lines of normalization under a suitable base.) These schemes would be of benefit to other high-precision 
constant and function calculations. And another scheme, high utilization of floating point operations in the 
processing of integers, is also favorable for the integer calculation, especially to number crunchers. 


As the Table 1 shows, it took 12 years for extending the length of known 7 value from 100,000 to 


1,000,000, 10 years from 1,000,000 to 10,000,000 and 4 years from 10,000,000 to the order of 
100,000,000. When can we unveil the digits after 1,000,000,000, how and by whom? 
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(>=17526200) 17526200 
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(33554432) 33554414 
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Kubo etc. ri 133554000 (48h 2m) 


Kanada & Tamora HITAC S-820/80 | 1988 (201326572) 201326551 | Sh 57m L{B4) 
Jan. 201326000 (7h 30m) 


Table 1. Historical records of the calculation of x performed on electronic computers. M, K, G, S, L, R, 
B4 and B2 are the formulae of Machin, Klingenstierna, Gauss, St8rmer, and Gauss-Legendre, formula 
explained in the reference[3], Borwein’s quartic convergent, Borwein’s quadratic convergent, respectively. 
Symbol 'x’ means ’unknown’. Check time means the additional time for the calculated value checking. 
This information was basically obtained from Mr. Shibata(18]. 
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[ Brent et. al._| [ Borwein’s quadratic 


Table 2. Comparison of basic, time consuming, operations in the three historically important algorithms for 
nm calculation, Here n is a number of iterations. In order to compare with fare, both side columns numbers 
should be doubled. 


[look | 9699 | ris7_ | soe | sooas | oom _| roars | 1002 | roors | oars | 902 | 4.09 | 
aon [ance | amos [soe [pono | igre [anoy [ese [ares |e | ape 17a 
Sei) rat went | see anes | Aa pie oe | oe a 
[aot | orm [mess [oer | ares | aos | ase: | er | —onie | aoseer [ ano | 900] 
Tc en me [aren [ome | one [ame || ae [oe | ee [oe 


Eg 
1959343 
| zsm | zsooise | aeooois | asaorer | casvaia | aseasas | asooise | a19oson | zaveovo | aasorso | aaoosa2 | 5.28 | 
130m | asesisz | sooasse | sooones | 2vooaaa | soeassa | soosee7 | apeeses | avosins | aypsez | soooiss | 436 | 
[som | avoscea | sovzz20 | sooosrs | seneso | sooscon | aooover | aopsor7 | apoetss | asnesse | asserss | 6:17 | 
[som | ronson | soozres | scorers | ropress | eoonsas | ropssoo | roneics | ro0ase0 | sooosos | soorose | 5.95 | 
[oom | sessezz | soonas7s | sooner | ooseasa | soocanes | sovsere | sovaai7 | o99pci0 | soooarto | sosssas | 7.27 | 
[som | rasosseo | isooisso | isooises | iepoeiso | sooneze | rapsasea | vesmees« | asoouce | ssoorais | ssooon2 | 4:90 | 
| 200m | 9997437 | zoonsrrs | aooonies | zoooraio | 1999045 | s99ss001 | rspeeie: | aooonaxr | aooaser | amoonse2 | 4.13 | 


Table 3. Summary of frequency for the first 200,000,000 digits of x - 3 and corresponding x? values. 


127 


164 9. 201,326,000 DECIMAL DIGITS OF PI CALCULATION (1988) 


References 


1. 


21. 


128 


Bailey, D.H., ‘‘The Computation of mi to 29,360,000 Decimal Digits Using Borwein’s Quartically 
Convergent Algorithm,’’ Math. Comp., vol. 50, no. 181, pp. 66-88, Jan. 1988. 

Borwein, J.M. and Borwein, P.B., Pi and the AGM - A study in Analytic Number Theory and Compu- 
tational Complexity, A Wiley-Interscience Publication, New York, 1987. 

Borwein, J.M. and Borwein, P.B., ‘‘Ramanujan and Pi,’’ Scientific American, vol. 258, pp. 66-73, 
Feb, 1988. 

Brent, R-P., ‘‘Fast Multiple-Precision Evaluation of Elementary Functions,”’ J. ACM., vol. 23, no. 2, 
pp. 242-251, April 1976, 

Brent, R.P., ‘‘Multiple-Precision Zero-Finding Methods and the Complexity of Elementary Function 
Evaluation,’’ in Analytic Computational Complexity, ed. Traub, I.F., pp. 151-176, Academic Press, 
New York, 1976. 

Gentleman, W.M. and Sande, G., ‘‘Fast Fourier Transforms - For Fun and Profit,’’ AFIPS Conf. 
Proc. FICC, vol. 29, pp. 563-578, Spartan Books, San Francisco, Calif., November 1966. 

Kanada, Y., ‘‘Statistical Analysis of m up to 200,000,000 Decimal Places,” CCUT-TR-88-02, in 
preparation, Computer Centre, University of Tokyo, Bunkyo-ku, Yayoi 2-11-16, Tokyo 113, Japan, 
Nov, 1988. 

Tamura, Y., Yoshino, S., Ushiro, Y. and Kanada, Y., ‘‘Circular Constant - It’s High Speed Calcula- 
tion Method and Statistics -,”” in Proceedings Programming Symposium, in Japanese, Information 
Processing Society of Japan, Jan. 1984. 

Tamura, Y. and Kanada, Y., ‘Calculation of m to 4,194,293 Decimals Based on the Gauss-Legendre 
Algorithm,’? CCUT-TR-83-01, Computer Centre, University of Tokyo, Bunkyo-ku, Yayoi 2-11-16, 
Tokyo 113, Japan, May 1983. 

Kaneko, T. and Liu, B., ‘Accumulation of Round-Off Error in Fast Fourier Transforms,’’ /. ACM., 
vol. 17, no. 4, pp. 637-654, Oct. 1970, 


Knuth, D.E., The Art of Computer Programming, Vol. 2: Seminumerical Algorithms, Addison-Wesley, 
Reading, Mass., 1981. 


Laasonen, P., ‘‘On the Iterative Solution of the Matrix Equation A X°2 - I = 0,’ Math. Tables and 
Other Aids to Comp., vol. 12, pp. 109-116, 1958. 

Pathria, R.K., ‘‘A Statistical Study of Randomness Among the First 10,000 Digits of 1,” Math. 
Comp., vol. 16, pp. 188-197, 1962. 


Salamin, E., ‘‘Computation of 2 Using Arithmetic-Geometric Mean,’’ Math. Comp., vol. 30, no. 135, 
pp. 565-570, July 1976. 

Schiénhage, A. and Strassen, V., ‘‘Schnelle Multiplikation Grosser Zahlen,’’ Computing, vol. 7, pp. 
281-292, 1971. 

Schutz, G., ‘‘Iterative Berechnung der Reziproken Matrix,” Z. Angew. Math. Mech., vol. 13, pp. 57- 
59, 1933. 


Shanks, D. and Wrench, Jr. W., ‘‘Calculation of x to 100,000 Decimals,’’ Math. Comp., vol. 16, pp. 
76-79, 1962. 


Shibata, A., 1982-1983. Private communications 


Tamura, Y., ‘‘Calculation of 2 to 2 Million Decimals Using Legendre-Gauss’ Relation,’’ Proceeding 
of the International Latitude Observatory of Mizusawa No.22 (in Japanese) in press, Internal Latitude 
Observatory of Mizusawa, Hoshigaoka-cho 2-12, Mizusawa City, 023 Iwate Prefecture, Japan, 1983. 
Ushiro, Y., ‘Calculation of Multiple-Precision Numbers through Vectorizing Processor,” in Numeri- 
cal Calculation Workshop, in Japanese, Information Processing Society of Japan, 26-28 May. 1983. 
Kanada, Y., Tamura, Y., Yoshino, S. and Ushiro, Y., ‘‘Calculation of m to 10,013,395 Decimal 
Places Based on the Gauss-Legendre Algorithm and Gauss Arctangent Relation,’’ CCUT-TR-84-01, 
Computer Centre, University of Tokyo, Bunkyo-ku, Yayoi 2-11-16, Tokyo 113, Japan, Dec 1983. 


10. Ramanujan and pi (1988) 


Paper 10: Jonathan M. Borwein and Peter B. Borwein, “Ramanujan and Pi” 
Scientific American, vol. 256 (February 1988), 112-117. 


Synopsis: 

In this paper, Jonathan and Peter Borwein wrote about the recent develop- 
ments in the computation of 7, and their roots in the work of Archimedes, Gauss, 
Ramanujan and others, for a broad, semi-popular audience — Scientific American, 
which is read by several hundred thousand readers worldwide. 

One of the more interesting items in this article is its presentation of a full set 
of formulas enabling one to compute one billion digits of 7 on a “pocket calculator”: 
namely to iterate the Borwein quartically convergent algorithm 15 times. As they 
jokingly point out in a footnote, though, you would need a pretty big calculator or 
else the computation would be pretty uninteresting after the second iteration. 


Keywords: Algorithms, General Audience, History, Modular Equations 
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Ramanujan and Pi 


Some 75 years ago an Indian mathematical genius developed ways 
of calculating pi with extraordinary efficiency. His approach is now 
incorporated in computer algorithms yielding millions of digits of pi 


by Jonathan M. Borwein and Peter B. Borwein 


i, the ratio of any circle’s cir- 
Pp cumference to its diameter, was 

computed in 1987 to an unprec- 
edented level of accuracy: more than 
100 million decimal places. Last year 
also marked the centenary of the 
birth of Srinivasa Ramanujan, an en- 
igmatic Indian mathematical genius 
who spent much of his short life in 
isolation and poor health. The two 
events are in fact closely linked, be- 
cause the basic approach underlying 
the most recent Computations of pi 
was anticipated by Ramanujan, al- 
though its implementation had to 
await the formulation of efficient al- 
gorithms (by various workers includ- 
ing us), modern supercomputers and 
new ways to multiply numbers. 

Aside from providing an arena in 
which to set records of a kind, the 
quest to calculate the number to mil- 
lions of decimal places may seem 
rather pointless. Thirty-nine places 
of pi suffice for computing the cir- 
cumference of a circle girdling the 
known universe wilh an error no 
greater than the raclius of a hydrogen 
atom. It is hard to imagine physical 
situations requiring more digits. Why 
are mathematicians and computer 
scientists not satisfied with, say, the 
first 50 digits of pi? 

Several answers can be given. One 
is that the calculation of pi has be- 
come something of a benchmark 
computation: it serves as a measure 
of the sophistication and reliability of 
the computers that carry it out. In ad- 
dition, the pursuit of ever more ac- 
curate values of pi leads mathema- 
ticians to intriguing and unexpect- 
ed niches of number theory. Another 
and more ingenuous motivation is 
simply “because it's there.” In fact, pi 
has been a fixture of mathematical 
culture for more than two and a half 
millenniums. 

Furthermore, there is always the 
chance that such computations will 


2 


shed light on some of the riddles sur- 
rounding pi, a universal constant that 
is not particularly well understood, 
in spite of its relatively clementary 
nature. For example, although it has 
been proved that pi cannot ever be 
exactly evaluated by subjecting posi- 
tive integers to any combination of 
adding, subtracting, multiplying, di- 
viding or extracting roots, no one has 
succeeded in proving that the digits 
of pi follow a random distribution 
(such that each number from 0 to 9 
appears with equal frequency). It is 
possible, albeit highly unlikely, that 
after a while all the remaining digits 
of pi are 0's and 1's or exhibit some 
other regularity. Moreover, pi turns 
up in all kinds of unexpected places 
that have nothing to do with circles. 
Ifa number is picked at random from 
the set of integers, for instance, the 
probability that it will have no re- 
peated prime divisors is six divided 
by the square of pi. No different from 
olher eminent mathematicians, Ra- 
manujan was prey to the fascinations 
of the number. 


he ingredients of the recent 
approaches to calculating pi are 
among the mathematical treasures 
unearthed by renewed interest in Ra- 
manujan’s work. Much of what he 
did, however, is still inaccessible to 
investigators. The body of his work 
is contained in his “Notebooks,” 
which are personal records written 
in his own nomenclature. lo make 
matters more frustrating for math- 
ematicians who have studied the 
“Notebooks,” Ramanujan generally 
did not include formal proofs for his 
theorems. ‘The task of deciphering 
and editing the “Notebooks” is only 
now nearing completion, by Bruce C. 
Berndt of the University of Illinois at 
Urbana-Champaign. 
Yo our knowledge no mathemat- 
cal redaction of this scope or difficul- 
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ty has ever been attempted. The ef- 
fort is certainly worthwhile. Rama- 
nujan's legacy in the “Notebooks” 
promises not only to enrich pure 
mathematics but also to find applica- 
tion in various fields of mathematical 
physics. Rodney J. Baxter of the Aus- 
tralian National University, for exam- 
ple, acknowledges that Ramanujan’s 
findings helped him to solve such 
problems in statistical mechanics as 
the so-called hard-hexagon model, 
which considers the behavior of a 
system of interacting particles laid 
out on a honeycomblike grid. Simi- 
larly, Carlos J. Moreno of the City Uni- 
versity of New York and Freeman J. 
Dyson of the Institute for Advanced 
Study have pointed out that Ramanu- 
jan's work is beginning to be applied 
by physicists in superstring theory. 

Ramanujan’s stature as a mathe- 
matician is all the more astonishing 
when one considers his limited for- 
mal education. He was born on De- 
cember 22, 1887, into a somewhat 
impoverished family of the Brahmin 
caste in the town of Erode in south- 
ern India and grew up in Kumba- 
konam, where his father was an ac- 
countant to a Clothier. His mathemat- 
ical precocity was recognized carly, 
and at the age of seven he was given 
a scholarship to the Kumbakonam 
Town High School. He is said to have 
recited mathematical formulas to his 
schoolmates—including the value of 
pi to many places. 

When he was 12, Ramanujan mas- 
tered the contents of S. bE. Loney's 
rather comprehensive Plane Trigo- 
nometry, including its discussion of 
the sum and products of infinite se- 
quences, which later were to figure 
prominently in his work. (An infinite 
sequence is an unending string of 
terms, often generated by a simple 
formula. In this context the interest- 
ing sequences are those whose terms 
can be added or multiplied to yield 
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an identifiable, finite value. If the 
terms are added, the resulting ex- 
pression is called a series; if they are 
multiplied, it is called a product.) 
Three years later he borrowed the 
Synopsis of Elementary Results in Pure 
Mathematics, a listing of some 6,000 
theorems (most of them given with- 
out proof) compiled by G. S. Carr, a 
tutor at the University of Cambridge. 
Those two books were the basis of 
Ramanujan’s mathematical training. 

In 1903 Ramanujan was admitted to 
a local government college. Yet total 
absorption in his own mathematical 
diversions at the expense of every- 
thing else caused him to fail his ex- 
aminations, a pattern repeated four 
years later at another college in Ma- 
dras. Ramanujan did set his avoca- 
tion aside—if only temporarily—to 
look for a job after his marriage in 
1909. Fortunately in 1910 R. Rama- 
chandra Rao, a well-to-do patron of 
mathematics, gave him a monthly 
stipend largely on the strength of fa- 
vorable recommendations from vari- 
ous sympathetic Indian mathemati- 
cians and the findings he already had 
jotted down in the “Notebooks.” 

In 1912, wanting more convention- 
al work, he took a clerical position 
in the Madras Port Trust, where the 
chairman was a British engineer, Sir 
Francis Spring, and the manager was 
V. Ramaswami Aiyar, the founder 
of the Indian Mathematical Socie- 
ty. They encouraged Ramanujan to 
communicate his results to three 
prominent British mathematicians. 
Two apparently did not respond; the 
one who did was G. I. Hardy of Cam- 
bridge, now regarded as the foremost 
British mathematician of the period. 


He, accustomed to receiving 
crank mail, was inclined to dis- 
regard Ramanujan’s letter at first 
glance the day it arrived, January 
16, 1913. But after dinner that night 
Hardy and a close colleague, John 
E. Littlewood, sat down to puzzle 
through a list of 120 formulas and the- 
orems Ramanujan had appended to 
his letter. Some hours later they had 
reached a verdict: they were seeing 
the work of a genius and not a crack- 
pot. (According to his own “pure-tal- 
ent scale” of mathematicians, Hardy 
was later to rate Ramanujan a 100, 
Littlewood a 30 and himself a 25. The 
German mathematician David Hil- 
bert, the most influential figure of the 
time, merited only an 80.) Hardy de- 
scribed the revelation and its conse- 
quences as the one romantic incident 
in his life. He wrote that some of 
Ramanujan’s formulas defeated him 
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completely, and yet “they must be 
true, because if they were not true, 
no one would have had the imagina- 
tion to invent them.” 

Hardy immediately invited Rama- 
nujan to come to Cambridge. In spite 
of his mother’s strong objections as 
well as his own reservations, Rama- 
nujan set out for England in March of 
1914. During the next five years Har- 
dy and Ramanujan worked together 
at Trinity College. The blend of Har- 
dy’s technical expertise and Rama- 
nujan’s raw brilliance produced an 
unequaled collaboration. They pub- 
lished a series of seminal papers on 
the properties of various arithmetic 
functions, laying the groundwork for 
the answer to such questions as: How 
many prime divisors is a given num- 
ber likely to have? How many ways 
can one express a number as a sum 
of smaller positive integers? 


In 1917 Ramanujan was made a Fel- 
low of the Royal Society of London 
and a Fellow of Trinity College—the 
first Indian to be awarded either hon- 
or. Yet as his prominence grew his 
health deteriorated sharply, a de- 
cline perhaps accelerated by the dif- 
ficulty of maintaining a strict vege- 
tarian diet in war-rationed England. 
Although Ramanujan was in and 
out of sanatoriums, he continued to 
pour forth new results. In 1919, when 
peace made travel abroad safe again, 
Ramanujan returned to India. Al- 
ready an icon for young Indian intel- 
lectuals, the 32-year-old Ramanujan 
died on April 26, 1920, of what was 
then diagnosed as tuberculosis but 
now is thought to have been a severe 
vitamin deficiency. True to mathe- 
matics until the end, Ramanujan did 
not slow down during his last, pain- 
racked months, producing the re- 


SRINIVASA RAMANUJAN, born in 1887 in India, managed in spite of limited formal 
education to reconstruct almost single-handedly much of the edifice of number theory 
and to go on to derive original theorems and formulas. Like many illustrious mathema- 
ticians before him, Ramanujan was fascinated by pi: the ratio of any circle’s circumfer- 
ence to its diameter. Based on his investigation of modular equations (see box on page 
114), he formulated exact expressions for pi and derived from them approximate val- 
ues. As a result of the work of various investigators (including the authors), Ramanu- 
jan’s methods are now better understood and have been implemented as algorithms. 
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markable work recorded in his so- 
called “Lost Notebook.” 


amanujan’s work on pi grew in 
large part out of his investigation 
of modular equations, perhaps the 
most thoroughly treated subject in 
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the “Notebooks.” Roughly speaking, 
a modular equation is an algebra- 
ic relation between a function ex- 
pressed in terms of a variable x—in 
mathematical notation, f(x)—and the 
same function expressed in terms of 
x raised to an integral power, for ex- 


PERIMETER OF 
CIRCUMSCRIBED PERIMETER OF 
POLYGON INSCRIBED POLYGON 
feantan (iP) WHERE F=nsin (1?) 
n= NUMBER 


R= 3.464: 


Pe= 3.215" 


R=3.159- 


OF SIDES 


P= 3132 


ARCHIMEDES’ METHOD for estimating pi relied on inscribed and circumscribed regu- 
lar polygons (polygons with sides of equal length) on a circle having a diameter of one 
unit (or a radius of half a unit). The perimeters of the inscribed and circumscribed poly- 
gons served respectively as lower and upper bounds for the value of pi. The sine and 
tangent functions can be used to calculate the polygons’ perimeters, as is shown here, 
but Archimedes had to develop equivalent relations based on geometric constructions. 
Using 96-sided polygons, he determined that pi is greater than 3'°/7: and less than 3?/7. 
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ample f(x’), f(x’) or f(x*). The “order” 
of the modular equation is given by 
the integral power. The simplest 
modular equation is the second-or- 
der one: f(x)=2 Vf(x") /[1 + f(x?)]. Of 
course, not every function will satis- 
fy a modular equation, but there is 
a class of functions, called modular 
functions, that do. These functions 
have various surprising symmetries 
that give them a special place in 
mathematics. 

Ramanujan was unparalleled in his 
ability to come up with solutions to 
modular equations that also satisfy 
other conditions. Such solutions are 
called singular values. It turns out 
that solving for singular values in 
certain cases yields numbers whose 
natural logarithms coincide with pi 
(times a constant) to a surprising 
number of places [see box on page 
114). Applying this general approach 
with extraordinary virtuosity, Rama- 
nujan produced many femarkable in- 
finite series as well as single-term ap- 
proximations for pi. Some of them 
are given in Ramanujan’s one formal 
paper on the subject, Modular Equa- 
tions and Approximations to tr, pub- 
lished in 1914. 

Ramanujan’s attempts to approxi- 
mate pi are part of a venerable tradi- 
tion. The earliest Indo-European civi- 
lizations were aware that the area of 
a circle is proportional to the square 
of its radius and that the circumfer- 
ence of a circle is directly proportion- 
al to its diameter. Less clear, howev- 
er, is when it was first realized that 
the ratio of any circle’s circumfer- 
ence to its diameter and the ratio of 
any circle’s area to the square of its 
radius are in fact the same constant, 
which today is designated by the 
symbol tr. (The symbol, which gives 
the constant its name, is a latecomer 
in the history of mathematics, having 
been introduced in 1706 by the En- 
glish mathematical writer William 
Jones and popularized by the Swiss 
mathematician Leonhard Euler in the 
18th century.) 


Fe cobs of Syracuse, the great- 
est mathematician of antiquity, 
rigorously established the equiva- 
lence of the two ratios in his treatise 
Measurement of a Circle. He also cal- 
culated a value for pi based on math- 
ematical principles rather than on di- 
rect measurement of a circle’s cir- 
cumference, area and diameter. What 
Archimedes did was to inscribe and 
circumscribe regular polygons (poly- 
gons whose sides are all the same 
length) on a circle assumed to have a 
diameter of one unit and to consider 
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the polygons’ respective perimeters 
as lower and upper bounds for possi- 
ble values of the circumference of the 
circle, which is numerically equal to 
pi [see illustration on opposite page|. 

This method of approaching a val- 
ue for pi was not novel: inscribing 
polygons of ever more sides in a cir- 
cle had been. proposed earlier by 
Antiphon, and Antiphon’s contempo- 
rary, Bryson of Heraclea, had added 
circumscribed polygons to the proce- 
dure. What was novel was Archime- 
des’ correct determination of the ef- 
fect of doubling the number of sides 
on both the circumscribed and the in- 
scribed polygons. He thereby devel- 
oped a procedure that, when repeat- 
ed enough times, enables one in prin- 
ciple to calculate pi to any number of 
digits. (It should be pointed out that 
the perimeter of a regular polygon 
can be readily calculated by means 
of simple trigonometric functions: 
the sine, cosine and tangent func- 
tions. But in Archimedes’ time, the 
third century B.c., such functions 
were only partly understood. Archi- 
medes therefore had to rely mainly 
on geometric constructions, which 
made the calculations considerably 
more demanding than they might ap- 
pear today.) 

Archimedes began with inscribed 
and circumscribed hexagons, which 
yield the inequality 3<m<2 V3. By 
doubling the number of sides four 
times, to 96, he narrowed the range 
of pi to between 3!%: and 3%, obtain- 
ing the estimate 13.14. There is 
some evidence that the extant text 
of Measurement of a Circle is only a 
fragment of a larger work in which 
Archimedes described how, starting 
with decagons and doubling them six 
times, he got a five-digit estimate: 
T= 3.1416. 

Archimedes’ method is conceptu- 
ally simple, but in the absence of a 
ready way to calculate trigonometric 
functions it requires the extraction of 
roots, which is rather time-consum- 
ing when done by hand. Moreover, 
the estimates converge slowly to pi: 
their error decreases by about a fac- 
tor of four per iteration. Neverthe- 
less, all European attempts to calcu- 
late pi before the mid-17th century 
relied in one way or another on 
the method. The 16th-century Dutch 
mathematician Ludolph van Ceulen 
dedicated much of his career to a 
computation of pi. Near the end of his 
life he obtained a 32-digit estimate 
by calculating the perimeter of in- 
scribed and circumscribed polygons 
having 2° (some 10’) sides. His val- 
ue for pi, called the Ludolphian num- 
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ber in parts of Europe, is said to have 
served as his epitaph. 


he development of calculus, 
largely by Isaac Newton and 
Gottfried Wilhelm Leibniz, made it 
possible to calculate pi much more 
expeditiously. Calculus provides ef- 
ficient techniques for computing 
a function’s derivative (the rate of 
change in the function’s value as its 
variables change) and its integral 
(the sum of the function’s:-values over 
a range of variables). Applying the 
techniques, one can demonstrate 
that inverse trigonometric functions 
are given by integrals of quadratic 
functions that describe the curve ofa 
circle. (The inverse of a trigonomet- 
ric function gives the angle that cor- 
responds to a particular value of the 
function. For example, the inverse 
tangent of 1 is 45 degrees or, equiva- 
lently, 1/4 radians.) 
(The underlying connection be- 


tween trigonometric functions and 
algebraic expressions can be appre- 
ciated by considering a circle that 
has aradius of one unit and its center 
at the origin of a Cartesian x-y plane. 
The equation for the circle—whose 
area is numerically equal to pi—is 
x*+ y= 1, which is a restatement of 
the Pythagorean theorem for a right 
triangle with a hypotenuse equal to 
1. Moreover, the sine and cosine of 
the angle between the positive x axis 
and any point on the circle are equal 
respectively to the point’s coordi- 
nates, y and x; the angle’s tangent is 
simply y/x.) 

Of more importance for the pur- 
poses of calculating pi, however, is 
the fact that an inverse trigonometric 
function can be “expanded” as a se- 
ries, the terms of which are comput- 
able from the derivatives of the func- 
tion. Newton himself calculated pi 
to 15 places by adding the first few 
terms of a series that can be derived 


MACHIN'S FORMULA (1706) 
== 4 arctan (1/5) — arctan (1/239), 


RAMANUJAN (1914) 


TT 


BORWEIN AND BORWEIN (1987) 


1 


WALLIS’ PRODUCT (1665) 
m_2x2, 4x4, 6x6, 8x8, _4n? 
2 1x3" 3x5" 5x7 7x9 Ak an2-) 
GREGORY'S SERIES (1671) 
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7 mame FA 21 ni 


where arctan X = X— ats 


x3. X5 X7 Ss x(2n+1) 
ee cee 
ars: 


1_ V8 & (4n)![1103 + 26,390n _ 2 . a s 
1-3, ai M03, + 75.3809), where n! = 1 x (n-1) x (n-2) x = x TandO! =1 
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TERMS OF MATHEMATICAL SEQUENCES can be summed or multiplied to yield values 
for pi (divided by a constant) or its reciprocal. The first two sequences, discovered re- 
spectively by the mathematicians John Wallis and James Gregory, are probably among 
the best-known, but they are practically useless for computational purposes. Not even 
100 years of computing on a supercomputer programmed to add or multiply the terms 
of either sequence would yield 100 digits of pi. The formula discovered by John Machin 
made the calculation of pi feasible, since calculus allows the inverse tangent (arc tan- 
gent) of a number, x, to be expressed in terms of a sequence whose sum converges 
more rapidly to the value of the arc tangent the smaller xis. Virtually all calculations for 
pi from the beginning of the 18th century until the early 1970’s have relied on varia- 
tions of Machin’s formula. The sum of Ramanujan's sequence converges to the true val- 
ue of 1/1 much faster: each successive term in the sequence adds roughly eight more 
correct digits. The last sequence, formulated by the authors, adds about 25 digits per 
term; the first term (for which n is 0) yields a number that agrees with pi to 24 digits. 
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as an expression for the inverse of 
the sine function. He later confessed 
to a colleague: “I am ashamed to tell 
you to how many figures | carried 
these calculations, having no other 
business at the time.” 

In 1674 Leibniz derived the formula 
1-1/3+1/5-1/7...=11/4, which is 
the inverse tangent of 1. (The general 
inverse-tangent series was originally 
discovered in 1671 by the Scottish 
mathematician James Gregory. In- 
deed, similar expressions appear to 
have been developed independently 
several centuries earlier in India.) 
The error of the approximation, de- 
fined as the difference between the 
sum of n terms and the exact value of 
1/4, is roughly equal to the n+1th 
term in the series. Since the denomi- 
nator of each successive term in- 
creases by only 2, one must add ap- 
proximately 50 terms to get two-digit 
accuracy, 500 terms for three-digit 
accuracy and so on. Summing the 
terms of the series to calculate a val- 
ue for pi more than a few digits long 
is clearly prohibitive. 

An observation made by John Ma- 
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chin, however, made it practicable to 
calculate pi by means of a series ex- 
pansion for the inverse-tangent func- 
tion. He noted that pi divided by 4 is 
equal to 4 times the inverse tangent 
of 1/5 minus the inverse tangent of 
1/239. Because the inverse-tangent 
series for a given value converges 
more quickly the smaller the value 
is, Machin’s formula greatly simpli- 
fied the calculation. Coupling his for- 
mula with the series expansion for 
the inverse tangent, Machin comput- 
ed 100 digits of pi in 1706. Indeed, his 
technique proved to be so powerful 
that all extended calculations of pi 
from the beginning of the 18th centu- 
ry until recently relied on variants of 
the method. 


wo 19th-century calculations de- 

serve special mention. In 1844 Jo- 
hann Dase computed 205 digits of pi 
in a matter of months by calculating 
the values of three inverse tangents 
in a Machin-like formula. Dase was a 
calculating prodigy who could mul- 
tiply 100-digit numbers entirely in 
his head—a feat that took him rough- 


where 


kp = Vale- #0) 


MODULAR FUNCTIONS AND APPROXIMATIONS TO PI 
A modular function is a function, A(q), that can be related through an algebraic expression 
called a modular equation to the same function expressed in terms of the same variable, q, 


raised to an integral power: A(q°). The integral power, p, determines the “order” of the 
modular equation. An example of a modular function is 


A 1+q2n \3 

NQ)i= 169 Il G + qa) - 

Its associated seventh-order modular equation, which relates \(q) to A(q?), is given by 
WAG) +Y/TT— XQ) — Ng) = 1. 


Singular values are solutions of modular equations that must also satisfy additional condi- 
tions. One class of singular values corresponds to computing a sequence of values, k, 


‘pr 


and p takes integer values. These values have the curious property that the logarithmic 


expression 


=2 tog( ke 
vp 3GQ) 


coincides with many of the first digits of pi. The number of digits the expression has in 
common with pi increases with larger values of p. 


Ramanujan was unparalleled in his ability to calculate these singular values. One of his most 
famous is the value when p equals 210, which was included in his original letter to G. H. 


Hardy. It is 


kayo = (V2—-1)*(2-V3)(V7 — V6)2(8-3.V 7('V 10 — 3)(V 15 — V 14)(4— V15)2(6 - 35) . 


This number, when plugged into the logarithmic expression, agrees with pi through the first 
20 decimal places. In comparison, k2« yields a number that agrees with pi through more 


than one million digits. 


Applying this general approach, Ramanujan constructed a number of remarkable series for 
pi, including the one shown in the illustration on the preceding page. The general approach 
also underlies the two-step, iterative algorithms in the top illustration on the opposite page. 
In each iteration the first step (calculating y,) corresponds to computing one of a sequence 
of singular values by solving a modular equation of the appropriate order; the second step 
(calculating «,) is tantamount to taking the logarithm of the singular value. 
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ly eight hours. (He was perhaps 
the closest precursor of the modern 
supercomputer, at least in terms of 
memory capacity.) In 1853 William 
Shanks outdid Dase by publishing 
his computation of pi to 607 places, 
although the digits that followed 
the 527th place were wrong. Shank’s 
task took years and was a rather rou- 
tine, albeit laborious, application of 
Machin’'s formula. (In what must itself 
be some kind of record, 92 years 
passed before Shank’s error was de- 
tected, in a comparison between his 
value and a 530-place approximation 
produced by D. F. Ferguson with the 
aid of amechanical calculator.) 

The advent of the digital comput- 
er saw a renewal of efforts to cal- 
culate ever more digits of pi, since 
the machine was ideally suited for 
lengthy, repetitive “number crunch- 
ing.” ENIAC, one of the first digital 
computers, was applied to the task in 
June, 1949, by John von Neumann 
and his colleagues. ENIAC produced 
2,037 digits in 70 hours. In 1957 G. E. 
Felton attempted to compute 10,000 
digits of pi, but owing to a machine 
error only the first 7,480 digits were 
correct. The 10,000-digit goal was 
reached by F. Genuys the following 
year on an IBM 704 computer. In 1961 
Daniel Shanks and John W. Wrench, 
Jr., calculated 100,000 digits of pi in 
less than nine hours on an IBM 7090. 
The million-digit mark was passed in 
1973 by Jean Guilloud and M. Bouyer, 
a feat that took just under a day of 
computation on a CDC 7600. (The 
computations done by Shanks and 
Wrench and by Guilloud and Bouyer 
were in fact carried out twice using 
different inverse-tangent identities 
for pi. Given the history of both hu- 
man and machine error in these cal- 
culations, it is only after such verifi- 
cation that modern “digit hunters” 
consider a record officially set.) 

Although an increase in the speed 
of computers was a major reason 
ever more accurate calculations for 
pi could be performed, it soon be- 
came clear that there were inescap- 
able limits. Doubling the number of 
digits lengthens computing time by 
at least a factor of four, if one applies 
the traditional methods of perform- 
ing arithmetic in computers. Hence 
even allowing for a hundredfold in- 
crease in computational speed, Guil- 
loud and Bouyer’s program would 
have required at least a quarter cen- 
tury to produce a billion-digit value 
for pi. From the perspective of the 
early 1970’s such a computation did 
notseem realistically practicable. 

Yet the task is now feasible, thanks 
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not only to faster computers but also 
to new, efficient methods for mul- 
tiplying large numbers in comput- 
ers. A third development was also 
crucial: the advent of iterative algo- 
rithms that quickly converge to pi. 
(An iterative algorithm can be ex- 
pressed as a computer program that 
repeatedly performs the same arith- 
metic operations, taking the output 
of one cycle as the input for the next.) 
These algorithms, some of which we 
constructed, were in many respects 
anticipated by Ramanujan, although 
he knew nothing of computer pro- 
gramming. Indeed, computers not 
only have made it possible to apply 
Ramanujan’s work but also have 
helped to unravel it. Sophisticated 
algebraic-manipulation software has 
allowed further exploration of the 
road Ramanujan traveled alone and 
unaided 75 years ago. 


ne of the interesting lessons of 

theoretical computer science is 
that many familiar algorithms, such 
as the way children are taught to 
multiply in grade school, are far from 
optimal. Computer scientists gauge 
the efficiency of an algorithm by 
determining its bit complexity: the 
number of times individual digits are 
added or multiplied in carrying out 
an algorithm. By this measure, add- 
ing two n-digit numbers in the nor- 
mal way has a bit complexity that in- 
creases in step with n; multiplying 
two n-digit numbers in the normal 
way has a bit complexity that in- 
creases as n’. By traditional methods, 
multiplication is much “harder” than 
addition in that itis much more time- 
consuming. 

Yet, as was shown in 1971 by A. 
Schénhage and V. Strassen, the mul- 
tiplication of two numbers can in the- 
ory have a bit complexity only a little 
greater than addition. One way to 
achieve this potential reduction in bit 
complexity is to implement so-called 
fast Fourier transforms (FFT’s). FFT- 
based multiplication of two large 
numbers allows the intermediary 
computations among individual dig- 
its to be carefully orchestrated so 
that redundancy is avoided. Because 
division and root extraction can be 
reduced to a sequence of multiplica- 
tions, they too can have a bit com- 
plexity just slightly greater than that 
of addition. The result is a tremen- 
dous saving in bit complexity and 
hence in computation time. For this 
reason all recent efforts to calculate 
pi rely on some variation of the rrr 
technique for multiplication. 

Yet for hundreds of millions of dig- 
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ITERATIVE ALGORITHMS that yield extremely accurate values of pi were developed 
by the authors. (An iterative algorithm is a sequence of operations repeated in such a 
way that the ouput of one cycle is taken as the input for the next.) Algorithm a con- 
verges to 1/1 quadratically: the number of correct digits given by «, more than dou- 
bles each time nis increased by L Algorithm b converges quartically and algorithm c 
converges quintically, so that the number of coinciding digits given by each iteration 
increases respectively by more than a factor of four and by more than a factor of five. 
Algorithm bis possibly the most efficient known algorithm for calculating pi; it was run 
on supercomputers in the last three record-setting calculations. As the authors worked 
on the algorithms it became clear to them that Ramanujan had pursued similar meth- 
ods in coming up with his approximations for pi. In fact, the computation of s, in algo- 
rithm crests ona remarkable fifth-order modular equation discovered by Ramanujan. 
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NUMBER OF KNOWN DIGITS of pi has increased by two orders of magnitude (factors 
of 10) in the past decade as a result of the development of iterative algorithms that can 
be run on supercomputers equipped with new, efficient methods of multiplication. 
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its of pi to be calculated practically a 
beautiful formula known a century 
and a half earlier to Carl Friedrich 
Gauss had to be rediscovered. In the 
mid-1970’s Richard P. Brent and Eu- 
gene Salamin independently noted 
that the formula produced an algo- 
rithm for pi that converged quadrati- 
cally, that is, the number of digits 
doubled with each iteration. Between 
1983 and the present Yasumasa Ka- 
nada and his colleagues at the Uni- 
versity of Tokyo have employed this 
algorithm to set several world rec- 
ords for the number of digits of pi. 

We wondered what underlies the 
remarkably fast convergence to pi of 
the Gauss-Brent-Salamin algorithm, 
and in studying it we developed gen- 
eral techniques for the construction 
of similar algorithms that rapidly 
converge to pi as well as to other 
quantities. Building on a theory out- 
lined by the German mathematician 
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Karl Gustav Jacob Jacobi in 1829, we 
realized we could in principle arrive 
at a value for pi by evalyating inte- 
grals of a class called elliptic inte- 
grals, which can serve to calculate 
the perimeter of an ellipse. (A circle, 
the geometric setting of previous ef- 
forts to approximate pi, is simply an 
ellipse with axes of equal length.) 


Elliptic integrals cannot generally. 


be evaluated as integrals, but they 
can be easily approximated through 
iterative procedures that rely on 
modular equations. We found that 
the Gauss-Brent-Salamin algorithm is 
actually a specific case of our more 
general technique relying on a sec- 
ond-order modular equation. Quick- 
er convergence to the value of the in- 
tegral, and thus a faster algorithm for 
pi, is possible if higher-order modu- 
lar equations are used, and so we 
have also constructed various algo- 
rithms based on modular equations 


Yo= V2-1 


Yr = [1-V1—yoA/1 411 —yo4] 


y2 = (1-1, 411 41 -y,4] 


yz = [1-1 - yA 1 ya] 


= [1-V1-y3411 471-34] 


ys = (1-1 yA 41 yah] 


Yo = [1-1 ~yo 4/1 4/1 — yo] 
V7 = [11 -V1 yet 41 — yo 
yg = [1-1 -y,4V11 1-4] 
Yo = [1-1 yg 4/11 +01 — yo 4] 


Yio = [1-V1=yo4 V1 0/1 — yo] 

Yan = W111 = y94] 
Yi2 = NV y 1 1-94] 
vig = (1-01 yq94 V1 481 — yy] 
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1-y34V1 1-434] 


yas = (1-1 yg +1 —yqa4] 


HOW TO GET TWO BILLION DIGITS OF PI 
WITH A CALCULATOR* 


ag = 6-4V2 


ay = (1+ y1)4 ag —23y4(1 +4 +42) 


ag = (1 +y2)4 a4 —25y2(1 +2 +y2") 

ag = (1+y3)4 a2 ~ 27y3(1 +y3+y3") 

ag = (1+ ya)4 03 —29yql1 +¥q+¥a") 

as = (1 +y5)4 ag—2"ya(1 +¥5 + Ys) 

ag = (1 +y¢)4 a5 — 213y6(1 +6 + ye") 

ag = (1+y7)4 ag 2! y9(1 +7 +97?) 

ag = (1+yg)4 «7-2 yg(1 + yg + Ye") 

ag = (1+Y9)4 ag - 2! 9y9(1 +¥9 +o") 

a49 = (1+yr9)4 ag —271yag(1 + y10 + ¥10”) 
ayy = (1+ y99)4 49 — 223 y44(1 + ¥45 +112) 
eg = (1+ y92)4 44 — 275yq2(1 + 12+ Ya2”) 
a3 = (1 +y43)4 eq —227y43(1 +13 + ¥13”) 


ag = (1+y rq) 043 — 22%y4q(1 +y1q+ ¥142) 


ays = (1+ y15)4 ayg - 234y45(1 + y45 + ¥15") 


1/a4s agress with 7 for more than two billion decimal digits 


*Of course, the calculator needs to have a two-billion-digit display; on a pocket calcu- 
lator the computation would not be very interesting after the second iteration. 


EXPLICIT INSTRUCTIONS for executing algorithm b in the top illustration on the pre- 
ceding page makes it possible in principle to compute the first two billion digits of piin 
a matter of minutes. All one needs is a calculator that has two memory registers and the 
usual capacity to add, subtract, multiply, divide and extract roots. Unfortunately most 
calculators come with only an eight-digit display, which makes the computation moot. 
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of third, fourth and higher orders. 
In January, 1986, David H. Bailey of 
the National Aeronautics and Space 
Administration’s Ames Research 
Center produced 29,360,000 decimal 
places of pi by iterating one of our 
algorithms 12 times on a Cray-2 su- 
percomputer. Because the algorithm 
is based on a fourth-order modular 
equation, it converges on pi quar- 
tically, more than quadrupling the 
number of digits with each iteration. — 
A year later Kanada and his col- 
leagues carried out one more it- 
eration to attain 134,217,000 places 
on an NEC SX-2 supercomputer and 
thereby verified a similar computa- 
tion they had done earlier using the 
Gauss-Brent-Salamin algorithm. (Iter- 
ating our algorithm twice more—a 
feat entirely feasible if one could 
somehow monopolize a supercom- 
puter for a few weeks—would yield 
more than two billion digits of pi.) 


terative methods are best suited 

for calculating pi on a computer, 
and so itis not surprising that Rama- 
nujan never bothered to pursue 
them. Yet the basic ingredients of the 
iterative algorithms for pi—modular 
equations in particular—are to be 
found in Ramanujan’s work. Parts of 
his original derivation of infinite se- 
ries and approximations for pi more 
than three-quarters of a century ago 
must have paralleled our own efforts 
to come up with algorithms for pi. In- 
deed, the formulas he lists in his pa- 
per on pi and in the “Notebooks” 
helped us greatly in the construction 
of some of our algorithms. For exam- 
ple, although we were able to prove 
that an llth-order algorithm exists 
and knew its general formulation, it 
was not until we stumbled on Rama- 
nujan’s modular equations of the 
same order that we discovered its un- 
expectedly simple form. 

Conversely, we were also able to 
derive all Ramanujan’s series from 
the general formulas we had devel- 
oped. The derivation of one, which 
converged to pi faster than any other 
series we knew at the time, came 
about with a little help from an unex- 
pected source. We had justified all 
the quantities in the expression for 
the series except one: the coefficient 
1,103, which appears in the numera- 
tor of the expression [see illustration 
on page 113). We were convinced— 
as Ramanujan must have been—that 
1,103 had to be correct. To prove it 
we had either to simplify a daunting 
equation containing variables raised 
to powers of several thousand or 
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to delve considerably further into 
somewhat arcane number theory. 

By coincidence R. William Gosper, 
Jr., of Symbolics, Inc., had decided in 
1985 to exploit the same series of Ra- 
manujan’s for an extended-accuracy 
value for pi. When he carried out the 
calculation to more than 17 million 
digits (arecord at the time), there was 
to his knowledge no proof that the 
sum of the series actually converged 
to pi. Of course, he knew that mil- 
lions of digits of his value coincided 
with an earlier Gauss-Brent-Salamin 
calculation done by Kanada. Hence 
the possibility of error was vanish- 
ingly small. 

As soon as Gosper had finished his 
calculation and verified it against 
Kanada’s, however, we had what we 
needed to prove that 1,103 was the 
number needed to make the series 
true to within one part in 10!9,000,000, 
In much the same way that a pair of 
integers differing by less than 1 must 
be equal, his result sufficed to specify 
the number: it is precisely 1,103. In ef- 
fect, Gosper’s computation became 
part of our proof. We knew that the 
series (and its associated algorithm) 
is so sensitive to slight inaccuracies 
that if Gosper had used any other val- 
ue for the coefficient or, for that mat- 
ter, if the computer had introduced 
a single-digit error during the calcu- 
lation, he would have ended up 
with numerical nonsense instead of 
a value for pi. 

Ramanujan-type algorithms for ap- 
proximating pi can be shown to be 
very close to the best possible. If all 
the operations involved in the exe- 
cution of the algorithms are totaled 
(assuming that the best techniques 
known for addition, multiplication 
and root extraction are applied), the 
bit complexity of computing n digits 
of pi is only marginally greater than 
that of multiplying two n-digit num- 
bers. But multiplying two n-digit 
numbers by means of an FFT-based 
technique is only marginally more 
complicated than summing two n- 
digit numbers, which is the simplest 
of the arithmetic operations possible 
on acomputer. 


athematics has probably not yet 

felt the full impact of Ramanu- 
jan’s genius. There are many other 
wonderful formulas contained in the 
“Notebooks” that revolve around 
integrals, infinite series and contin- 
ued fractions (a number plus a frac- 
tion, whose denominator can be 
expressed as a number plus a frac- 
tion, whose denominator can be ex- 
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RAMANUJAN'S “NOTEBOOKS” were personal records in which he jotted down many 
of his formulas. The page shown contains various third-order modular equations—all 
in Ramanujan’s nonstandard notation. Unfortunately Ramanujan did not bother to in- 
clude formal proofs for the equations; others have had to compile, edit and prove them. 
The formulas in the “Notebooks” embody subtle relations among numbers and func- 
tions that can be applied in other fields of mathematics or even in theoretical physics. 


pressed as a number plus a fraction, 
and so on). Unfortunately they are 
listed with litthe—if any—indication 
of the method by which Ramanujan 
proved them. Littlhewood wrote: “If 
a significant piece of reasoning oc- 
curred somewhere, and the total mix- 
ture of evidence and intuition gave 
him certainty, he looked no further.” 

The herculean task of editing the 
“Notebooks,” initiated 60 years ago 
by the British analysts G. N. Watson 
and B. N. Wilson and now being com- 
pleted by Bruce Berndt, requires pro- 
viding a proof, a source or an occa- 
sional correction for each of many 
thousands of asserted theorems and 
identities. A single line in the “Note- 
books” can easily elicit many pages 
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of commentary. The task is made all 
the more difficult by the nonstandard 
mathematical notation in which the 
formulas are written. Hence a great 
deal of Ramanujan’s work will not 
become accessible to the mathemat- 
ical community until Berndt’s proj- 
ectis finished. 

Ramanujan’s unique capacity for 
working intuitively with complicated 
formulas enabled him to plant seeds 
in a mathematical garden (to borrow 
a metaphor from Freeman Dyson) 
that is only now coming into bloom. 
Along with many other mathema- 
ticians, we look forward to seeing 
which of the seeds will germinate in 
future years and further beautify the 
garden. 
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11. Ramanujan, modular equations, and 
approximations to pi or how to compute one 
billion digits of pi (1989) 


Paper 11: Jonathan M. Borwein, Peter B. Borwein, and David H. Bailey, “Ra- 
manujan, modular equations, and approximations to pi, or how to compute one 
billion digits of pi,” American Mathematical Monthly, vol. 96 (1989), p. 201-219. 
Copyright 1989 Mathematical Association of America. All Rights Reserved. 


Synopsis: 

Here the authors reprise the history of a, describe the recently discovered 
quadratically convergent algorithms, describe (briefly) the computational tech- 
niques required to compute, say, one billion digits of 7, and then explore in con- 
siderable detail the connections between these mathematical developments and the 
recently uncovered writings of Ramanujan, the hundredth anniversary of whose 
birth was celebrated in 1987. As the article observes, if Ramanujan had accom- 
plished so much with paper, pencil and slateboard, how much further could he have 
seen if he had access to a modern symbolic computing environment such as Maple 
or Mathematica? 

Interestingly, although not as asymptotically fast as the quadratically conver- 
gent formulas, Ramanujan discovered some very interesting formulas, e.g. 


1 v8 = (4n)!(1103 + 26390n) 
mn 9801 (n!)43964”" ; 


n=0 


a variation of which was used by David and Gregory Chudnovsky to compute up 
to two billion digits (when accompanied with some very clever reorganization of 
the computation) in the early 1990s. Each term of this formula adds roughly eight 
correct digits. 


Keywords: Algorithms, Computation, Elliptic Integrals, History, Modular Equa- 
tions 
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and 
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Preface. The year 1987 was the centenary of Ramanujan’s birth. He died in 1920 
Had he not died so young, his presence in modern mathematics might be more 
immediately felt. Had he lived to have access to powerful algebraic manipulation 
software, such as MACSYMA, who knows how much more spectacular his already 
astonishing career might have been. 

This article will follow up one small thread of Ramanujan’s work which has 
found a modern computational context, namely, one of his approaches to approxi- 
mating pi. Our experience has been that as we have come to understand these pieces 
of Ramanujan’s work, as they have become mathematically demystified, and as we 
have come to realize the intrinsic complexity of these results, we have come to 
realize how truly singular his abilities were. This article attempts to present a 
considerable amount of material and, of necessity, little is presented in detail. We 
have, however, given much more detail than Ramanujan provided. Our intention is 
that the circle of ideas will become apparent and that the finer points may be 
pursued through the indicated references. 


1. Introduction. There is a close and beautiful connection between the transfor- 
mation theory for elliptic integrals and the very rapid approximation of pi. This 
connection was first made explicit by Ramanujan in his 1914 paper “Modular 
Equations and Approximations to 7” [26]. We might emphasize that Algorithms 1 
and 2: are not to be found in Ramanujan’s work, indeed no recursive approximation 
of m is considered, but as we shall see they are intimately related to his analysis. 
Three central examples are: 


Sum 1. (Ramanujan) 


1 y8 2 (4n)! [1103 + 263907] 


a 9801, (n!)* 306% 
Algorithm 1. Let a) := 6 — 4y2 and y) = y2 — 1. 
Let 
eee 6 ee 
Tt 7 
ail begat 
and 


On+1 = (1 a Yaar) Oy = IPOs (l + Vat. oe es 


201 
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Then 
0 <a, —1/7 < 16+ 4%?" 
and a, converges to 1/m quartically (that is, with order four). 


Algorithm 2. Let sy := 5(¥5 — 2) and ay *= 1/2. 


Let 
25 
Sn. 0= 
oh (ast gia eye 
where 
x=5/s,-1 yo(x-1P4+7 
and 
1 1/5 
z= Fe + yy? — «| . 
Let 
se —5 
G41 = 52a, — | + ys,(s2 — 2s, + 5) \ 
Then 


1 
0<a,—-— <16-5"e5" 
T 


and a, converges to 1/ quintically (that is, with order five). 


{March 


Each additional term in Sum 1 adds roughly eight digits, each additional iteration 
of Algorithm 1 quadruples the number of correct digits, while each additional 
iteration of Algorithm 2 quintuples the number of correct digits. Thus a mere 
thirteen iterations of Algorithm 2 provide in excess of one billion decimal digits of 
pi. In general, for us, pth-order convergence of a sequence {«,,) to a means that a,, 


tends to «@ and that 


|@n44 a| s Cla, = al? 


for some constant C > 0. Algorithm 1 is arguably the most efficient algorithm 
currently known for the extended precision calculation of pi. While the rates of 
convergence are impressive, it is the subtle and thoroughly nontransparent nature of 
these results and the beauty of the underlying mathematics that intrigue us most. 


Watson [37], commenting on certain formulae of Ramanujan, talks of 


a thrill which is indistinguishable from the thrill which I feel when I enter the 
Sagrestia Nuovo of the Capella Medici and see before me the austere beauty of 
the four statues representing “Day,” “Night,” “Evening,” and “Dawn” which 
Michelangelo has set over the tomb of Giuliano de’Medici and Lorenzo 


de’ Medici. 


Sum 1 is directly due to Ramanujan and appears in [26]. It rests on a modular 
identity of order 58 and, like much of Ramanujan’s work, appears without proof 
and with only scanty motivation. The first complete derivation we know of appears 
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in [11]. Algorithms 1 and 2 are based on modular identities of orders 4 and 5, 
respectively. The underlying quintic modular identity in Algorithm 2 (the relation 
for s,,) is also due to Ramanujan, though the first proof is due to Berndt and will 
appear in [77]. 

One intention in writing this article is to explain the genesis of Sum 1 and of 
Algorithms 1 and 2. It is not possible to give a short self-contained account without 
assuming an unusual degree of familiarity with modular function theory. Also, parts 
of the derivation involve considerable algebraic calculation and may most easily be 
done with the aid of a symbol manipulation package (MACSYMA, MAPLE, 
REDUCE, etc.). We hope however to give a taste of methods involved. The full 
details are available in [11]. 

A second intention is very briefly to describe the role of these and related 
approximations in the recent extended precision calculations of pi. In part this 
entails a short discussion of the complexity and implementation of such calcula- 
tions. This centers on a discussion of multiplication by fast Fourier transform 
methods. Of considerable related interest is the fact that these algorithms for 7 are 
provably close to the theoretical optimum. 


2. The State of Our Current Ignorance. Pi is almost certainly the most natural of 
the transcendental numbers, arising as the circumference of a circle of unit diame- 
ter. Thus, it is not surprising that its properties have been studied for some 
twenty-five hundred years. What is surprising is how little we actually know. 

We know that 7 is irrational, and have known this since Lambert’s proof of 1771 
(see [5]). We have known that 7 is transcendental since Lindemann’s proof of 1882 
[23]. We also know that 7 is not a Liouville number. Mahler proved this in 1953. An 
irrational number 8 is Liouville if, for any n, there exist integers p and g so that 


1 


a meee 
n 


q 
Liouville showed these numbers are all transcendental. In fact we know that 


Pp 1 
bs q > gee (2.1) 
for p,q integral with q sufficiently large. This irrationality estimate, due to 
Chudnovsky and Chudnovsky [16] is certainly not best possible. It is likely that 
14.65 should be replaced by 2 + « for any e > 0. Almost all transcendental numbers 
satisfy such an inequality. We know a few related results for the rate of algebraic 
approximation. The results may be pursued in [4} and [11]. 

We know that e” is transcendental. This follows by noting that e” = (—1)~‘ and 
applying the Gelfond-Schneider theorem [4]. We know that 7 + log2 + V2 log3 is 
transcendental. This result is a consequence of the work that won Baker a Fields 
Medal in 1970. And we know a few more than the first two hundred million digits 
of the decimal expansion for 7 (Kanada, see Section 3). 

The state of our ignorance is more profound. We do not know whether such basic 
constants as 7 + e, 1/e, or log a are irrational, let alone transcendental. The best 
we can say about these three particular constants is that they cannot satisfy any 
polynomial of degree eight or less with integer coefficients of average size less than 
10° {3}. This is a consequence of some recent computations employing the 


0 < 


wes 
q 
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Ferguson-Forcade algorithm [17]. We don’t know anything of consequence about 
the single continued fraction of pi, except (numerically) the first 17 million terms, 
which Gosper computed in 1985 using Sum 1. Likewise, apart from listing the first 
many millions of digits of 7, we know virtually nothing about the decimal expan- 
sion of w. It is possible, albeit not a good bet, that all but finitely many of the 
decimal digits of pi are in fact 0’s and 1’s, Carl Sagan’s recent novel Contact rests on 
a similar possibility. Questions concerning the normality of or the distribution of 
digits of particular transcendentals such as 7 appear completely beyond the scope 
of current mathematical techniques. The evidence from analysis of the first thirty 
million digits is that they are very uniformly distributed [2]. The next one hundred 
and seventy million digits apparently contain no surprises. 

In part we perhaps settle for computing digits of + because there is little else we 
can currently do. We would be amiss, however, if we did not emphasize that the 
extended precision calculation of pi has substantial application as a test of the 
“global integrity” of a supercomputer. The extended precision calculations de- 
scribed in Section 3 uncovered hardware errors which had to be corrected before 
those calculations could be successfully run. Such calculations, implemented as in 
Section 4, are apparently now used routinely to check supercomputers before they 
leave the factory. A large-scale calculation of pi is entirely unforgiving; it soaks into 
all parts of the machine and a single bit awry leaves detectable consequences. 


3. Matters Computational 


1 am ashamed to tell you to how many figures I carried these calculations, having 
no other business at the time. 
lsaac Newton 


Newton’s embarrassment at having computed 15 digits, which he did using the 
arcsinlike formula 


3V3 
4 


12 5-23 28 - 2? 72-2? 
3y3 
a> + 24 [Vx = x? dx, 

8) 


is indicative both of the spirit in which people calculate digits and the fact that a 
surprising number of people have succumbed to the temptation [5]. 

The history of efforts to determine an accurate value for the constant we now 
know as 7 is almost as long as the history of civilization itself. By 2000 B.c. both the 
Babylonians and the Egyptians knew 7 to nearly two decimal places. The Babyloni- 
ans used, among others, the value 3 1/8 and the Egyptians used 3 13/81. Not all 
ancient societies were as accurate, however—nearly 1500 years later the Hebrews 
were perhaps still content to use the value 3, as the following quote suggests. 


q 
I 
i) 
2 
| 


Also, he made a molten sea of ten cubits from brim to brim, round in compass, 
and five cubits the height thereof; and a line of thirty cubits did compass it round 
about. 

Old Testament, 1 Kings 7:23 


Despite the long pedigree of the problem, all nonempirical calculations have 
employed, up to minor variations, only three techniques. 
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i) The first technique due to Archimedes of Syracuse (287-212 8.c.) is, recur- 
sively, to calculate the length of circumscribed and inscribed regular 6 - 2”-gons 
about a circle of diameter 1. Call these quantities a, and b,, respectively. Then 


ro 


ay = 2¥3, b, == 3 and, as Gauss’s teacher Pfaff discovered in 1800, 
2a,,b, 


-— and 5b,,, = ja,,,5,. 
a, + b, met y4@n+19n 


Antt = 


Archimedes, with n = 4, obtained 
30 <a < 3h. 


While hardly better than estimates one could get with a ruler, this is the first method 
that can be used to generate an arbitrary number of digits, and to a nonnumerical 
mathematician perhaps the problem ends here. Variations on this theme provided 
the basis for virtually ali calculations of 7 for the next 1800 years, culminating with 
a 34 digit calculation due to Ludolph van Ceulen (1540-1610). This demands 
polygons with about 2° sides and so is extraordinarily time consuming. 


ii) Calculus provides the basis for the second technique. The underlying method 
relies on Gregory’s series of 1671 
x dt x? ‘ a ; 
t oo ao Be eee et Sts < 
arctan x f Ta x 3 5 |x| 


coupled with a formula which allows small x to be used, like 


7 1 1 
Fi =4 artan| 5 - aretan| 5 . 


This particular formula is due to Machin and was employed by him to compute 100 
digits of 7 in 1706. Variations on this second theme are the basis of all the 
calculations done until the 1970’s including William Shanks’ monumental hand- 
calculation of 527 digits. In the introduction to his book [32], which presents this 
calculation, Shanks writes: 


Towards the close of the year 1850 the Author first formed the design of rectifying 
the circle to upwards of 300 places of decimals. He was fully aware at that time, 
that the accomplishment of his purpose would add little or nothing to his fame as a 
Mathematician though it might as a Computer; nor would it be productive of 
anything in the shape of pecuniary recompense. 


Shanks actually attempted to hand-calculate 707 digits but a mistake crept in at 
the 527th digit. This went unnoticed until 1945, when D. Ferguson, in one of the 
last “nondigital” calculations, computed 530 digits. Even with machine calculations 
mistakes occur, so most record-setting calculations are done twice—by sufficiently 
different methods. 

The advent of computers has greatly increased the scope and decreased the toil of 
such calculations. Metropolis, Reitwieser, and von Neumann computed and ana- 
lyzed 2037 digits using Machin’s formula on ENIAC in 1949. In 1961, Dan Shanks 
and Wrench calculated 100,000 digits on an IBM 7090 [31]. By 1973, still using 
Machin-like arctan expansions, the million digit mark was passed by Guillard and 
Bouyer on a CDC 7600. 
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iii) The third technique, based on the transformation theory of elliptic integrals, 
provides the algorithms for the most recent set of computations. The most recent 
records are due separately to Gosper, Bailey, and Kanada. Gosper in 1985 calcu- 
lated over 17 million digits (in fact over 17 million terms of the continued fraction) 
using a carefully orchestrated evaluation of Sum 1. 

Bailey in January 1986 computed over 29 million digits using Algorithm 1 on a 
Cray 2 [2]. Kanada, using a related quadratic algorithm (due in basis to Gauss and 
made explicit by Brent [12] and Salamin [27]) and using Algorithm 1 for a check, 
verified 33,554,000 digits. This employed a HITACHI S-810/20, took roughly eight 
hours, and was completed in September of 1986. In January 1987 Kanada extended 
his computation to 2°? decimal places of 7 and the hundred million digit mark had 
been passed. The calculation took roughly a day and a half on a NEC SX2 machine. 
Kanada’s most recent feat (Jan. 1988) was to compute 201,326,000 digits, which 
required only six hours on a new Hitachi S-820 supercomputer. Within the next few 
years many hundreds of millions of digits will no doubt have been similarly 
computed. Further discussion of the history of the computation of pi may be found 
in [5] and [9]. 


4. Complexity Concerns. One of the interesting morals from theoretical com- 
puter science is that many familiar algorithms are far from optimal. In order to be 
more precise we introduce the notion of bit complexity. Bit complexity counts the 
number of single operations required to complete an algorithm. The single-digit 
operations we count are +,—,X. (We could, if we wished, introduce storage and 
logical comparison into the count. This, however, doesn’t affect the order of growth 
of the algorithms in which we are interested.) This is a good measure of time on a 
serial machine. Thus, addition of two n-digit integers by the usual method has bit 
complexity O(7), and straightforward uniqueness considerations show this to be 
asymptotically best possible. 

Multiplication is a different story. Usual multiplication of two n-digit integers 
has bit complexity O(n”) and no better. However, it is possible to multiply two 
n-digit integers with complexity O(n(logn)(loglogz)). This result is due to 
Schénhage and Strassen and dates from 1971 [29]. It provides the best bound known 
for multiplication. No multiplication can have speed better than O(n). Unhappily, 
more exact results aren’t available. 

The original observation that a faster than O(n”) multiplication is possible was 
due to Karatsuba in 1962. Observe that 


(a + b10"){c + d10") = ac + [(a — b)(c — d) ~ ac — bd] 10" + bd10*", 


and thus multiplication of two 2n-digit integers can be reduced to three multiplica- 
tions of n-digit integers and a few extra additions. (Of course multiplication by 10” 
is just a shift of the decimal point.) If one now proceeds recursively one produces a 
multiplication with bit complexity 
oO n log 3) ; 

Note that log, 3 = 1.58... < 2. 

We denote by M(n) the bit complexity of multiplying two n-digit integers 
together by any method that is at least as fast as usual multiplication. 


The trick to implementing high precision arithmetic is to get the multiplication 
right. Division and root extraction piggyback off multiplication using Newton’s 
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method. One may use the iteration 
X yay = 2X XEY 


to compute 1/y and the iteration 


1 y 
as mae i, on 
k 


to compute vy . One may also compute 1/ vy from 


x, (3 — yx?) 
Ce a 


and so avoid divisions in the computation of vy . Not only do these iterations 
converge quadratically but, because Newton’s method is self-correcting (a slight 
perturbation in x, does not change the limit), it is possible at the Ath stage to work 
only to precision 2*. If division and root extraction are so implemented, they both 
have bit complexity O(M(n)), in the sense that n-digit input produces n-digit 
accuracy in a time bounded by a constant times the speed of multiplication. This 
extends in the obvious way to the solution of any algebraic equation, with the 
startling conclusion that every algebraic number can be computed (to_n-digit 
accuracy) with bit complexity O(M(n)). Writing down n-digits of 2 or 3V7 is (up 
to a constant) no more complicated than multiplication. 

The Schénhage-Strassen multiplication is hard to implement. However, a multi- 
plication with complexity O(log n)**'n) based on an ordinary complex (floating 
point) fast Fourier transform is reasonably straightforward. This is Kanada’s 
approach, and the recent records all rely critically on some variations of this 
technique. 

To see how the fast Fourier transform may be used to accelerate multiplication, 
let x = (Xo, Xy, Xp,---> X__1) aNd y *= (N%, Vs Yos-+-s Vp —1) be the representations 
of two high-precision numbers in some radix b. The radix b is usually selected to be 
some power of 2 or 10 whose square is less than the largest integer exactly 
representable as an ordinary floating-point number on the computer being used. 
Then, except for releasing each “carry,” the product z ‘= (Zo, 2), 22,---; Z2,~4) Of x 
and y may be written as 


Z9 = XoNo 
21 = XoV + XM 
24 = Xo V2 + XM, + X2Mo 


n-1 = XoVn—1 + Xn—2 Fo Xo 


Z2n-3 Xn—1¥n-2 oh Xn—-2)n—1 
Zon—-2 = Xn—-1Vn-1 
Zo,~-1 = 0. 


Now consider x and y to have n zeros appended, so that x, y, and z all have 
length N = 2n. Then a key observation may be made: the product sequence z is 
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precisely the discrete convolution C(x, y): 

N-1 

2. = C(x, y) = Le xs 

j=0 

where the subscript & — j is to be interpreted as k —j + N if k —j is negative. 
Now a well-known result of Fourier analysis may be applied. Let F(x) denote 

the discrete Fourier transform of the sequence x, and let F~'(x) denote the inverse 
discrete Fourier transform of x: 


N-1 
F,(x) = y aera 
j=0 
1 N-i vf 
F(x) ss, i x Lier ore™, 
f=0 


Then the “convolution theorem,” whose proof is a straightforward exercise, states 
that 


F[C(x, y)] = F(x) F(y) 
or, expressed another way, 


C(x, y) = FL F(x) F(y)]. 

Thus the entire multiplication pyramid z can be obtained by performing two 
forward discrete Fourier transforms, one vector complex multiplication and one 
inverse transform, each of length N = 2n. Once the real parts of the resulting 
complex numbers have been rounded to the nearest integer, the final multiprecision 
product may be obtained by merely releasing the carries modulo b. This may be 
done by starting at the end of the z vector and working backward, as in elementary 
school arithmetic, or by applying other schemes suitable for vector processing on 
more sophisticated computers. 

A straightforward implementation of the above procedure would not result in 
any computational savings—in fact, it would be several times more costly than the 
usual “schoolboy” scheme. The reason this scheme is used is that the discrete 
Fourier transform may be computed much more rapidly using some variation of the 
well-known “fast Fourier transform” (FFT) algorithm [13]. In particular, if N = 2”, 
then the discrete Fourier transform may be evaluated in only 5m2” arithmetic 
operations using an FFT. Direct application of the definition of the discrete Fourier 
transform would require 2?”*? floating-point arithmetic operations, even if it is 
assumed that all powers of e~*"‘/" have been precalculated. 

This is the basic scheme for high-speed multiprecision multiplication. Many 
details of efficient implementations have been omitted. For example, it is possible to 
take advantage of the fact that the input sequences x and y and the output 
sequence z are all purely real numbers, and thereby sharply reduce the operation 
count. Also, it is possible to dispense with complex numbers altogether in favor of 
performing computations in fields of integers modulo large prime numbers. Inter- 
ested readers are referred to [2], [8], [13], and [22]. 

When the costs of all the constituent operations, using the best known tech- 
niques, are totalled both Algorithms 1 and 2 compute x digits of 7 with bit 
complexity O( M(n)log 2), and use O(log n) full precision operations. 
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The bit complexity for Sum 1, or for 7 using any of the arctan expansions, is 
between O((log n)*M(n)) and O(nM(n)) depending on implementation. In each 
case, one is required to sum O(7) terms of the appropriate series. Done naively, one 
obtains the latter bound. If the calculation is carefully orchestrated so that the terms 
are grouped to grow evenly in size (as rational numbers) then one can achieve the 
former bound, but with no corresponding reduction in the number of operations. 

The Archimedean iteration of section 2 converges like 1/4” so in excess of n 
iterations are needed for n-digit accuracy, and the bit complexity is O(nM(n)). 

Almost any familiar transcendental number such as e, y, ¢(3), or Catalan’s 
constant (presuming the last three to be nonalgebraic) can be computed with bit 
complexity O((log 2)M(n)) or O(log n)?M(n)). None of these numbers is known 
to be computable essentially any faster than this. In light of the previous observa- 
tion that algebraic numbers are all computable with bit complexity O(M(n)), a 
proof that 7 cannot be computed with this speed would imply the transcendence of 
a. It would, in fact, imply more, as there are transcendental numbers which have 
complexity O( M(n)). An example is 0.10100100001... . 

It is also reasonable to speculate that computing the nth digit of w is not very 
much easier than computing all the first digits. We think it very probable that 
computing the nth digit of 7 cannot be O(7). 


5. The Miracle of Theta Functions 


When I was a student, abelian functions were, as an effect of the Jacobian 
tradition, considered the uncontested summit of mathematics, and each of us was 
ambitious to make progress in this field. And now? The younger generation 
hardly knows abelian functions. 

Felix Klein [21] 


Felix Klein’s lament from a hundred years ago has an uncomfortable timelessness 
to it. Sadly, it is now possible never to see what Bochner referred to as “the miracle 
of the theta functions” in an entire university mathematics program. A small piece 
of this miracle is required here [6], [11], [28]. First some standard notations. The 
complete elliptic integrals of the first and second kind, respectively, 


K(k) = oo (5.1) 
0 yl — k?sin’s 


and 
E(k) = pec — ksin’t dt. (5.2) 
0 


The second integral arises in the rectification of the ellipse, hence the name elliptic 
integrals, The complementary modulus is 


kiss v1 _ k? 
and the complementary integrals K’ and E’ are defined by 
K(k) = K(k’) and E(k) = E(k’). 


The first remarkable identity is Legendre’s relation namely 


E(k)K'(k) + E'(k) K(k) ~ K(k) K(k) = ; (5.3) 
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(for 0 < k < 1), which is pivotal in relating these quantities to pi. We also need to 
define two Jacobian theta functions 


@(g)= Yo gary (5.4) 
and 
@(q)= Yq”. (5.5) 
n=—~0 


These are in fact specializations with (¢ = 0) of the general theta functions. More 
generally 


foo} 
@,(t,.q)= Sgt — (imr> 0) 
with similar extensions of @,. In Jacobi’s approach these general theta functions 
provide the basic building blocks for elliptic functions, as functions of ¢ (see [11], 
(39}). 
The complete elliptic integrals and the special theta functions are related as 
follows. For {g| < 1 


K(k) = 534) (5.6) 
and 
nae dK(k) 
E(k) =(kK’Y| K(k) +k am (5.7) 
where 
03 @2(- 
kaka) =p. Keg) = Se? (5.8) 
and 
gq =e TR R (5.9) 
The modular function d is defined by 
@,(q) | 
A(t) = A(q) = kg) = Eel (5.10) 


where 
qi= eitt 
We wish to make a few comments about modular functions in general before 
restricting our attention to the particular modular function A. Modular functions are 
functions which are meromorphic in H, the upper half of the complex plane, and 
which are invariant under a group of linear fractional transformations, G, in the 
sense that 


f(g(z))=f(z) We eG. 
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{Additional growth conditions on f at certain points of the associated fundamental 
region (see below) are also demanded.] We restrict G to be a subgroup of the 
modular group T where T is the set of all transformations w of the form 


at+b 
whe) ct+d’ 
with a, b, c,d integers and ud — be = 1. Observe that T is a group under composi- 
tion. A fundamental region F, is a set in H with the property that any element in H 
is uniquely the image of some element in F, under the action of G. Thus the 
behaviour of a modular function is uniquely determined by its behaviour on a 
fundamental region. 

Modular functions are, in a sense, an extension of elliptic (or doubly periodic) 
functions~-functions such as sm which are invariant under linear transformations 
and which arise naturally in the inversion of elliptic integrals. 

The definitions we have given above are not complete. We will be more precise in 
our discussion of A. One might bear in mind that much of the theory for A holds in 
considerably greater generality. 

The fundamental region F we associate with A is the set of complex numbers 


F:= {imt>0} 9 [{jree]<1 and 
2¢+ 1)>1} U {rer = -1} U (Pr + 1 =13]. 
The A-group (or theta-subgroup) is the set of linear fractional transformations w 
satisfying 
at +b 
td’ 


w(t) = 


where a, b,c, d are integers and ad — be = 1, while in addition a and d are odd 
and b and e¢ are even. Thus the corresponding matrices are unimodular. What 
makes A a A-modular function is the fact that A is meromorphic in {im ¢ > 0} and 
that 


A(w(t)) = A(2) 


for all w in the A-group, plus the fact that A tends to a definite limit (possibly 
infinite) as ¢ tends to a vertex of the fundamental region (one of the three points 
(0, — 1), (0, 0), (i, 00)). Here we only allow convergence from within the fundamental 
region. 

Now some of the miracle of modular functions can be described. Largely because 
every point in the upper half plane is the image of a point in F under an element of 
the A-group, one can deduce that any A-modular function that is bounded on F is 
constant. Slightly further into the theory, but relying on the above, is the result that 
any two modular functions are algebraically related, and resting on this, but further 
again into the field, is the following remarkable result. Recall that g is given by 
(5.9). 


THEOREM 1. Let z be a primitive pth root of unity for p an odd prime. Consider the 
pth order modular equation for \ as defined by 


W, (x, X) = (x — Ag)(x — Ay) +++ (x -A,), (5.11) 
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where 
Medel") i <p 
and 


A, = A(q?). 


Then the function W, is a polynomial in x and X (independent of q), which has integer 
coefficients and is of degree p + 1 in both x and X. 


The modular equation for A usually has a simpler form in the associated 
variables u = x1/8 and v:= X'/8, In this form the Sth-order modular equation is 
given by 


Q5(u,v) = uS — vo + 5u7v?(u? — v2) + 4un(1 — u4o*). (5.12) 
In particular 


8(q?) _ of: (4) _ e 
®;(q”) ®,(q) 


are related by an algebraic equation of degree p + 1. 
The miracle is not over. The pth-order multiplier (for \) is defined by 


K(k(q?)) ee | 


M,(k(q),k(q?)) = =.) (5.13) 


K(k(q)) 
and turns out to be a rational function of k(q?) and k(q). 
One is now in possession of a pth-order algorithm for K/z, namely: Let 
k, = k(q?’). Then 


2K(ko) _ 


T 


My'(ko, k,)M, (ki; ky)M,*(ky, ky) +>. 


This is an entirely algebraic algorithm. One needs to know the pth-order modular 
equation for A to compute k;,, from k; and one needs to know the rational 
multiplier M,. The speed of convergence (O(c?’), for some ¢ < 1) is easily deduced 
from (5.13) and (5.9). 

The function A(t) is 1-1 on F and has a well-defined inverse, A~1, with branch 
points only at 0, 1 and oo. This can be used to provide a one line proof of the “big” 
Picard theorem that a nonconstant entire function misses at most one value (as does 
exp). Indeed, suppose g is an entire function and that it is never zero or one; then 
exp(A~1(g(z))) is a bounded entire function and is hence constant. 

Littlewood suggested that, at the right point in history, the above would have 
been a strong candidate for a ‘one line doctoral thesis’. 


6. Ramanujan’s Solvable Modular Equations. Hardy [19] commenting on 
Ramanujan’s work on elliptic and modular functions says 


It is here that both the profundity and limitations of Ramanujan’s knowledge 
stand out most sharply. 


We present only one of Ramanujan’s modular equations. 
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THEOREM 2. 
50 25 
Te) ai tag (6 
@, ( q) 


where for i = 1 and 2 


nm de(y Wa) 
with 


_ 58;(q°) 
ee "0,(4) 


This is a slightly rewritten form of entry 12(iii) of Chapter 19 of Ramanujan’s 
Second Notebook (see [7], where Berndt’s proofs may be studied). One can think of 
Ramanujan’s quintic modular equation as an equation in the multiplier M, of 
(5.13). The initial surprise is that it is solvable. The quintic modular relation for A, 
W,, and the related equation for 1/8, Q., are both nonsolvable. The Galois group of 
the sixth-degree equation 2, (see (5.12)) over Q(v) is A, and is nonsolvable. Indeed 
both Hermite and Kronecker showed, in the middle of the last century, that the 
solution of a general quintic may be effected in terms of the solution of the 
5th-order modular equation (5.12) and the roots may thus be given in terms of 
the theta functions. 

In fact, in general, the Galois group for W, of (5.11) has order p(p + 1)(p — 1) 
and is never solvable for p > 5. The group is quite easy to compute, it is generated 
by two permutations. If 


—1 and y:=(x-—1) +7. 


ent 


qe, then r>7+2 and OF naa) 


are both elements of the A-group and induce permutations on the A, of Theorem 1. 
For any fixed p, one can use the g-expansion of (5.10) to compute the effect of 
these transformations on the A,, and can thus easily write down the Galois group. 

While W, is not solvable over Q(A), it is solvable over Q(A, Ay). Note that A, is 
a root of W,. It is of degree p + 1 because W, is irreducible. Thus the Galois group 
for W, over Q(X, Aq) has order p( p — 1). For p = 5, 7, and 11 this gives groups of 
order 20, 42, and 110, respectively, which are obviously solvable and, in fact, for 
general primes, the construction always produces a solvable group. 

From (5.8) and (5.10) one sees that Ramanujan’s modular equation can be 
rewritten to give A, solvable in terms of A, and A. Thus, we can hope to find an 
explicit solvable relation for A, in terms of A and Aj. For p = 3, W, is of degree 4 
and is, of course, solvable. For p = 7, Ramanujan again helps us out, by providing a 
solvable seventh-order modular identity for the closely related eta function defined 
by 


loa] 
n(q) = 4" TT (1-9). 
n= 
The first interesting prime for which an explicit solvable form is not known is the 


“endecadic” (p = 11) case. We consider only prime values because for nonprime 
values the modular equation factors. 
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This leads to the interesting problem of mechanically constructing these equa- 
tions. In principle, and to some extent in practice, this is a purely computational 
problem. Modular equations can be computed fairly easily from (5.11) and even 
more easily in the associated variables u and v. Because one knows a priori bounds 
on the size of the (integer) coefficients of the equations one can perform these 
calculations exactly. The coefficients of the equation, in the variables u and v, grow 
at most like 2”. (See [11].) Computing the solvable forms and the associated 
computational problems are a little more intricate— though still in principle entirely 
mechanical. A word of caution however: in the variables u and v the endecadic 
modular equation has largest coefficient 165, a three digit integer. The endecadic 
modular equation for the intimately related function J (Klein’s absolute invariant) 
has coefficients as large as 


27090964785531389931563200281035226311929052227303 x 2973195711753. 


It is, therefore, one thing to solve these equations, it is entirely another matter to 
present them with the economy of Ramanujan. 

The paucity of Ramanujan’s background in complex analysis and group theory 
leaves open to speculation Ramanujan’s methods. The proofs given by Berndt are 
difficult. In the seventh-order case, Berndt was aided by MACSYMA-~a sophisti- 
cated algebraic manipulation package. Berndt comments after giving the proof of 
various seventh-order modular identities: 


Of course, the proof that we have given is quite unsatisfactory because it is a 
verification that could not have been achieved without knowledge of the result. 
Ramanujan obviously possessed a more natural, transparent, and ingenious 


proof. 


7. Modular Equations and Pi. We wish to connect the modular equations of 
Theorem 1 to pi. This we contrive via the function alpha defined by: 


iis E(k) tT 
a(r ioe! K(k) = (2K(k))?’ 


(7.1) 


where 
k:=k(q) and gw=e™, 
This allows one to rewrite Legendre’s equation (5.3) in a one-sided form without the 
conjugate variable as 
ca 
47 KLE - (vr - afr) x]. (7.2) 


We have suppressed, and will continue to suppress, the k variable. With (5.6) and 
(5.7) at hand we can write a q-expansion for a, namely, 


Ss (7.3) 
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and we can see that as r tends to infinity g = e~" tends to zero and a(r) tends to 
1/7. In fact 


a(r) — ~ sr - = levi (7.4) 


The key now is iteratively to calculate a. This is the content of the next theorem. 


THEOREM 3. Let ky ‘= k(q), ky = k(q?) and M, = M,(ko, ky) as in (5.13). 
Then 


a( pr) = 


where represents the full derivative of M,, with respect to ko. In particular, « is 
algebraic for rational arguments. 


We know that K(k,) is related via M, to K(k) and we know that E(k) is 
related via differentiation to K. (See (5.7) and (5.13).) Note that g — q? corre- 
sponds to r ~ p?r. Thus from (7.2) some relation like that of the above theorem 
must exist. The actual derivation requires some careful algebraic manipulation. (See 
{11}, where it has also been made entirely explicit for p := 2, 3, 4, 5, and 7, and 
where numerous algebraic values are determined for a(r).) In the case p ‘= 5 we 
can specialize with some considerable knowledge of quintic modular equations to 
get: 


THEOREM 4. Let s = 1/M.(Kpo, k,). Then 


a(25r) = sta(r) - 6] 9) 4 (e738 FH | 


This couples with Ramanujan’s quintic modular equation to provide a derivation of 
Algorithm 2. 

Algorithm 1 results from specializing Theorem 3 with p = 4 and coupling it with 
a quartic modular equation. The quartic equation in question is just two steps of the 
corresponding quadratic equation which is Legendre’s form of the arithmetic 
geometric mean iteration, namely: 


2Vk 
ae ce a 


An algebraic pth-order algorithm for 7 is derived from coupling Theorem 3 with 
a pth-order modular equation. The substantial details which are skirted here are 
available in [11]. 


ky 


8. Ramanujan’s sum. This amazing sum, 
1 v8 & (4n)! [1103 + 26390n] 


7 9801, (n!)4 3964" 


is a specialization (N = 58) of the following result, which gives reciprocal series for 
vt in terms of our function alpha and related modular quantities. 
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THEOREM 5. 
1 1 3 Pye 
1_¢ tal ta) (@}.0 aa 
= (nt)? = 
where, 
ts -1 
; 4k y (kK) : gn + Bx? 
Xn = Fie aoe = age n 
(1 + k3) 
with 
a(N)xy'  ¥N ay By 
d,(N) = | ——— - —gy*| + nN {| 
AN ae eee (NG 
and 


ky ke), gh = (ky) /(2ky). 
Here (c),, is the rising factorial: (c), = c(¢ + Ile + 2)-+-(e +n — 2). 


Some of the ingredients for the proof of Theorem 5, which are detailed in [11], 
are the following. Our first step is to write (7.2) as a sum after replacing the E by K 
and dK /dk using (5.7). One then uses an identity of Clausen’s which allows one to 
write the square of a hypergeometric function ,F, in terms of a generalized 
hypergeometric ,F,, namely, for all k one has 


ere 2K(k)/? pith haa 2 : 
+ = —,-,5 ‘| soa 
( — pu Gar aap as Ba eo 


« GLELG) leeeal 


n=0 (1),@),; n} 
Here g is related to k by 
4k(k’y i te 
(1 +k?) 2 


as required in Theorem 5. We have actually done more than just use Clausen’s 
identity, we have also transformed it once using a standard hypergeometric substitu- 
tion due to Kummer. Incidentally, Clausen was a nineteenth-century mathematician 
who, among other things, computed 250 digits of 7 in 1847 using Machin’s formula. 
The desired formula (8.1) is obtained on combining these pieces. 

Even with Theorem 5, our work is not complete. We still have to compute 


keg =k (e758) and atsg = (58). 
In fact 


853 = 5 
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is a well-known invariant related to the fundamental solution to Pell’s equation for 
29 and it turns out that 


y29 +5)" 
O53 = () (99729 — 444)(9972 — 70 — 13729). 
One can, in principle, and for N := 58, probably in practice, solve for ky by directly 
solving the Nth-order equation 


Wy (kx 1 = ki) =O: 


For N = 58, given that Ramanujan [26} and Weber [38] have calculated g., for us, 
verification by this method is somewhat easier though it still requires a tractable 
form of W,,. Actually, more sophisticated number-theoretic techniques exist for 
computing ky (these numbers are called singular moduli). A description of such 
techniques, including a reconstruction of how Ramanujan might have computed the 
various singular moduli he presents in [26]; is presented by Watson in a long series 
of papers commencing with [36]; and some more recent derivations are given in [11] 
and [30]. An inspection of Theorem 5 shows that all the constants in Series 1 are 
determined from g.g. Knowing a is equivalent to determining that the number 1103 
is correct. 

It is less clear how one explicitly calculates a.. in algebraic form, except by brute 
force, and a considerable amount of brute force is required; but a numerical 
calculation to any reasonable accuracy is easily obtained from (7.3) and 1103 
appears! The reader is encouraged to try this to, say, 16 digits. This presumably is 
what Ramanujan observed. Ironically, when Gosper computed 17 million digits of 7 
using Sum 1, he had no mathematical proof that Sum 1 actually converged to 1/7. 
He compared ten million digits of the calculation to a previous calculation of 
Kanada et al. This verification that Sum 1 is correct to ten million places also 
provided the first complete proof that asg is as advertised above. A nice touch—that 
the calculation of the sum should prove itself as it goes. 

Roughly this works as follows. One knows enough about the exact algebraic 
nature of the components of d,(N) and x, to know that if the purported sum (of 
positive terms) were incorrect, that before one reached 3 million digits, this sum 
must have ceased to agree with 1/7. Notice that the components of Sum 1 are 
related to the solution of an equation of degree 58, but virtually no irrationality 
remains in the final packaging. Once again, there are very good number-theoretic 
reasons, presumably unknown to Ramanujan, why this must be so (58 is at least a 
good candidate number for such a reduction). Ramanujan’s insight into this 
marvellous simplification remains obscure. 

Ramanujan [26] gives 14 other series for 1/7, some others almost as spectacular 
as Sum 1—and one can indeed derive some even more spectacular related series.* 
He gives almost no explanation as to their genesis, saying only that there are 
“corresponding theories” to the standard theory (as sketched in section 5) from 
which they follow. Hardy, quoting Mordell, observed that “it is unfortunate that 
Ramanujan has not developed the corresponding theories.” By methods analogous 


*(Added in proof) Many related series due to Borwein and Borwein and to Chudnovsky and 
Chudnovsky appear in papers in Ramanujan Revisited, Academic Press, 1988. 
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to those used above, all his series can be derived from the classical theory [11]}. 
Again it is unclear what passage Ramanujan took to them, but it must in some part 
have diverged from ours. 

We conclude by writing down another extraordinary series of Ramanujan’s, 
which also derives from the same general body of theory, 


342n +5 


1 00 
ae (2%) 
n giant4 


This series is composed of fractions whose numerators grow like 2° and whose 
denominators are exactly 16 - 2'?". In particular this can be used to calculate the 
second block of n binary digits of 7 without calculating the first » binary digits. 
This beautiful observation, due to Holloway, results, disappointingly, in no intrinsic 
reduction in complexity. 


9, Sources. References [7], [11], [19], [26], [36], and [37] relate directly to Ra- 
manujan’s work. References [2], [8], [9], [10], [12], [22], [24], [27], [29], and [31] 
discuss the computational concerns of the paper. 

Material on modular functions and special functions may be pursued in [1], [6], 
[9], [14], {15}, [18], [20], [28], [34], [38], and [39]. Some of the number-theoretic 
concerns are touched on in [3}, [6], [9], [11], [16], [23], and [35}. 

Finally, details of all derivations are given in [11]. 
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Synopsis: 

In this paper, Jonathan Borwein, Peter Borwein and Karl Dilcher described 
a remarkable phenomenon originally brought to their attention by R. D. North of 
Colorado Springs, Colorado. North found that if one uses Gregory’s series for 7, 
namely 7 = 407° ,(—1)*~1/(2k—1), to compute decimal digits of 7, but truncates 
the sum to 500,000 terms, one finds, not surprisingly, that the sum is incorrect in 
the sixth digit. But surprisingly, the next ten digits are correct, and a similar 
on-again, off-again patterns continues much further. 

Borwein, Borwein and Dilcher were above to explain this curious phenomenon 
— first by looking up the sequence of discrepancy values in Sloane’s Handbook of 
Integer Sequences (this was before the online version was available), identifying 
these numbers as Euler numbers, applying the Boole summation formula, and ex- 
tending the result to some other series. The resulting study not only explains a 
very interesting feature of the decimal expansion of 7, but it is also a classic of 
experimental mathematics in action. 


Keywords: Curiosities, Series 


© Springer International Publishing Switzerland 2016 197 
D.H. Bailey, JM. Borwein, Pi: The Next Generation, 
DOI 10.1007/978-3-319-32377-0_12 


12. PI, EULER NUMBERS, AND ASYMPTOTIC EXPANSIONS (1989) 199 


Pi, Euler Numbers, and Asymptotic Expansions 


J. M. BorweIn, P. B. BORWEIN, and K. Ditcuer!, Dathousie University, Halifax, Canada 


JONATHAN M. BorwEIN was an Ontario Rhodes Scholar (1971) at Jesus 
College, Oxford, where he completed a D. Phil. (1974) with Michael Demp- 
ster. Since 1974 he has worked at Dalhousie University where he is professor 
of mathematics. He has also been on faculty at Carnegie-Mellon University 
(1980-82). He was the 1987 Coxeter-James lecturer of the Canadian Mathe- 
matical Society and was awarded the Atlantic Provinces Council on the 
Sciences 1988 Gold Medal for Research. His research interests include 
functional analysis, classical analysis, and optimization theory. 


PETER B. BORWEIN obtained a Ph.D. (1979) from the University of 
British Columbia, under the supervision of David Boyd. He spent 1979-80 as 
a NATO research fellow in Oxford. Since then he has been on faculty at 
Dalhousie (except for a sabbatical year at the University of Toronto) and is 
now Associate Professor of Mathematics. His research interests include 
approximation theory, classical analysis, and complexity theory. 


Kari DILCHER received his undergraduate education and a Dipl. Math. 
degree at Technische Universitat Clausthal in West Germany. He completed 
his Ph.D (1983) at Queen’s University in Kingston, Ontario with Paulo 
Ribenboim. Since 1984 he has been teaching at Dalhousie, where he is now 
Assistant Professor. His research interests include Bernoulli numbers and 
polynomials, and classical complex analysis. 


1. Introduction. Gregory’s series for 7, truncated at 500,000 terms, gives to forty 
places 


500,000 (—4)*-? 


aD Tk-1. 3.141590653589793240462643383269502884197. 
k=1 


The number on the right is not 7 to forty places. As one would expect, the 6th 
digit after the decimal point is wrong. The surprise is that the next 10 digits are 
correct. In fact, only the 4 underlined digits aren’t correct. This intriguing observa- 
tion was sent to us by R. D. North [10] of Colorado Springs with a request for an 
explanation. The point of this article is to provide that explanation. Two related 
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examples, to fifty digits, are 
50,000 ( — 1 aa 


T 
2 payee 
= 1.5707863267948976192313211916397520520985833147388 
1 -—1 5 ~ 61 
and 
50,000 (1) *+1 
ipa. 


k 
k=1 
.69313718065994530939723212147417656804830013446572, 


1 -1 2 — 16 272 


where all but the underlined digits are correct. The numbers under the underlined 
digits are the numbers that must be added to correct these. The numbers 1, —1, 5, 
—61 are the first four Euler numbers while 1, —1, 2, —16, 272 are the first five 
tangent numbers. Our process of discovery consisted of generating these sequences 
and then identifying them with the aid of Sloane’s Handbook of Integer Sequences 
{11]. What one is observing, in each case, is an asymptotic expansion of the error in 
Euler summation. The amusing detail is that the coefficients of the expansion are 
integers. All of this is explained by Theorem 1. 

The standard facts we need about the Euler numbers { E;}, the tangent numbers 
{T,}, and the Bernoulli numbers { B;}, may all be found in [1] or in [6]. The numbers 
are defined as the coefficients of the power series 


= Be ye as 
sec z = X(-1) Qnyt’ (1.1) 
0 Ty gant 
tan z = mi aE and T, = 1, (1.2) 
Zz ° Bz" 
e7—1 r nao ft! GO) 
They satisfy the relations 
(2 
» (37 |x =0,  Egny1 = 9, (1.4) 
=0 
BPS) 
B, = 2"(2"—1) n> 1, (1.5) 
and 
y ("{*)a.=0. (1.6) 


k=0 


These three identities allow for the easy generation of { F£,,}, {7,,}, and {B,}. The 
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first few values are recorded below. 


GG i cP ee 
T, 1 ~1 2 —16 272 


: -1 1 -1 1 -1 
2 6 30. 42 30° 
It is clear from (1.4) that the Euler numbers are integral. From (1.5) and (1.6) it 
follows that the tangent numbers are integers. Also, 


4"*1(2n)! 2(2n)! 
|Eon| ~ Sart and Boul ~ Gaye 


as follows from (5.1) and (5.2) below. The main content of this note is the following 
theorem. The simple proof we offer relies on the Boole Summation Formula, which 
is a pretty but less well-known analogue of Euler summation. The details are 
contained in Sections 2 and 3 (except for c] which is a straightforward application of 
Euler summation). More complicated developments can be based directly on Euler 
summation or on results in [9]. 


THEOREM 1. The following asymptotic expansions hold: 
a NA(-1)*? @  B,, 


are) 
a] 2 x Ik -1 xy Nm 
| es | 5 61 
i ING ONS 
a ee  Dom=1 
b] tog2- E -———~ + Fe 
k=l k N m=1 N? 
1 1 ‘ 2 16 2m 
~ N N2 N4 NE” NB 
and 
gm? N71 4 1 2 Bo 
‘| 6) 27 int? 2 yet 
1 1 i 1 1 


= r+ asta re ees toe 
N 2N* 6N3 30N°% 9 42N7 

From the asymptotics of { £,} and { B,} and (1.5) we see that each of the above 
infinite series is everywhere divergent; the correct interpretation of their asymptotics 
is 


: Ean Ean (2K + 1)! 
a’] Dia im FO ae 
m=1 m=1 N (aN) 
b] x Doge > ene (2K + 1)! 
mr N2™ sect N2” (mn )?*"? 


2 By Kk B. (2K +1)! 
c'] oy Nomtl XL Nomi +0 TKe1 > 
m=1 m=l (27N) 
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where in each case the constant concealed by the order symbol is independent of NV 
and K. In fact, the constant 10 works in all cases. 


2. The Boole Summation Formula. The Euler polynomials £,,(x) can be defined 
by the generating function 


2e* 00 t” 
[7 - LEO (l<9): (2.1) 
n=0 ° 


(see [1, p. 804]). Each E,(x) is a polynomial of degree n with leading coefficient 1. 
We also define the periodic Euler function £,,(x) by 


E,(x +1) = -E,(x) 
for all x, and 
E(x) =E,(x) for 0<x<1. 


It can be shown that E,(x) has continuous derivatives up to the (” — 1)st order. 
The following is known as Boole’s summation formula (see, for example, 


[9. p. 34). 


LEMMA 1. Let f(t) be a function with m continuous derivatives, defined on the 
intervalx <t <x +. Then forO <h<1 


m-1 


fx + he) =D SE (n)- (F(x + 0) +(x)) + Re 


where 


1 fo 
o 


R,= > ia a 
50 aD f(x + ot) dt 


m 


This summation formula is easy to establish by repeated integration by parts of 
the above integral. It is remarked in [9, p. 26] that this formula was known to Euler, 
for polynomial f and without the remainder term. Also note that Lemma 1 turns 
into Taylor’s formula with Lagrange’s remainder term if we replace h by h/w and 
let w approach zero. 

To derive a convenient version of Lemma 1 for the applications we have in mind, 
we set w = 1 and impose further restrictions on f. 


LEMMA 2. Let f be a function with m continuous derivatives, defined on t > x. 
Suppose that f(t) > 0 ast > © for allk = 0,1,...,m. Then forO<h<1 


co m-1 
E (76+ hv) = SHY pngy er, 
v=0 k=0 . 
where 
1 p00 ECA - t) 
R,,= ai ss ae (x + t)dt. 
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3. The Remainder for Gregory’s Series. The Euler numbers £, may also be 
defined by the generating function 


(3.1) 


Comparing (3.1) with (2.1), we see that 
1 
E, = 26,(5}. (3.2) 


The phenomenon mentioned in the introduction is entirely explained by the next 
proposition—if we set n = 500,000. It is also clear that we will get similar patterns 
for n = 10”/2 with any positive integer m. 


PROPOSITION 1. For positive integers n and M we have 


= 


ao 
4d ok =(-1 y » oe oat +R i(M), (3.3) 
where 
2| Eau 
|Ri(M) |< Ont 
Proof. Apply Lemma 2 with f(x) = 1/x; then set x = n and h = 1/2. We get 
20 =1)’ m~1 F (1/2) (—1)*k! 
$_D tet2) (avian a 
peottut 1/2 ,2, 2k! n 
with 
1 ,o E,.(h—t) (-1)"m! 
igo Gt 
249 (m-1)! (x+1)” 


We multiply both sides of (3.4) by 2(—1)”. Then the left-hand side is seen to be 
identical with the left-hand side of (3.3). After replacing m by 24 + 1 and taking 
into account (3.2) and the fact that odd-index Euler numbers vanish, we see that the 
first terms on the right-hand sides of (3.3) and (3.4) agree. To estimate the error 
term, we use the following inequality, 


[Exu(x)| < 2°] Eyy| for0<x<1 


(see, e.g., [1, p. 805]). Carrying out the integration now leads to the error estimate 
given in Proposition 1. Oo 


4. An Analogue For log 2. Lemma 2 can also be used to derive a result similar to 
Proposition 1, concerning truncations of the series 


log2 = y es (4.1) 


In this case the tangent numbers T, will play the role of the E,, in Proposition 1. It 
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follows from the identity 


1 Qeriz 5 
t =-|—--— - 
me lee 


together with (1.2) and (2.1) that 
T, 


n 


(-1)"2"E, (1) (4.2) 


as in (9, p. 28]. The 7, can be computed using the recurrence relation T) = 1 and 
y eee +T,=0 forn> 1. 
k=0 


Other properties can be found, e.g., in [8] or [9, Ch. 2]. 


PROPOSITION 2. For positive integers n and M we have 


— (-1)**? u M Tyee) 
yy —s= ys +> + R,(M), (4.3) 
kentl k 2n- yal (2n)** ; 
where 
| E> | 


\Ro(M)| < Gamat 


Proof. We proceed as in the proof of Proposition 1. Here we take x =n and 
h = 1. Using (4.2) and the fact that T)=1 and T,, = 0 for k > 1, we get the 
summation on the right-hand side of (4.3). The remainder term is estimated as in the 
proof of Proposition 1. o 

Using Proposition 2 with n = 10”/2 one again gets many more correct digits of 
log 2 than is suggested by the error term of the Taylor series. 


5. Generalizations. Proposition 1 and 2 can be extended easily in two different 
directions. 
i). The well-known infinite series (see, e.g., [1, p. 807}) 


as (-1)* = | E>, | n+1 = 
Py Gey = Pom (n=0,1,...), (5.1) 
and 
FL a 2-00) 
k=1 
= (22-1 Date (Gein) (5.2) 


can be considered as extensions of Gregory’s series and of (4.1). These series admit 
exact analogues to Propositions 1 and 2; one only has to replace f(x) = 1/x by 
f(x) = x7@"*), respectively x7”, in the proofs. 
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We note that the Euler-MacLaurin summation formula leads to similar results for 


00 | B. [Qen* 

Y ko = Sg, (5.3) 
cay (2n)! 

where multiples of the Bernoulli numbers B,, take the place of the E, and J, in 
Propositions 1 and 2. 


ii). A generalization of the Euler-MacLaurin and Boole summation formulas was 
derived by Berndt [3]. This can be applied to character analogues of the series 
(5.1)-(5.3). The roles of the E,, and T, in Proposition 1 and 2 are then played by 
generalized Bernoulli numbers or by related numbers. 


6. Additional Comments. The phenomenon observed in the introduction results 
from taking N to be a power of ten; taking N = 2-10” also leads to “clean” 
expressions. References [1], [5], [6], and [9] include the basic material on Bernoulli 
and Euler numbers, while [8] deals extensively with their calculation, and [2] 
describes an entertaining analogue of Pascal’s triangle. Much on the calculation of 
pi and related matters may be found in [4]. Euler summation is treated in [5], [6], 
and [9], while Boole summation is treated in [9]. Related material on the computa- 
tion and acceleration of alternating series is given in [7]. 


Added in Proof. A version of the phenomeon was observed by M. R. Powell 
and various explanations were offered (see The Mathematical Gazette, 66 (1982) 
220-221, and 67(1983) 171-188). 


REFERENCES 


1. M. Abramowitz and I. Stegun, Handbook of Mathematical Functions, Dover, N.Y., 1964. 
2. M.D. Atkinson, How to compute the series expansions of sec x and tan x, Amer. Math. Monthly, 
93 (1986) 387-388. 
3. B.C. Berndt, Character analogues of the Poisson and Euler-MacLaurin summation formulas with 
applications, J. Number Theory, 7 (1975 413-445. 
4. J. M. Borwein and P. B. Borwein, Pi and the AGM - A Study in Analytic Number Theory and 
Computational Complexity, Wiley, N.Y., 1987. 
§. T. J. l’a Bromwich, An Introduction to the Theory of Infinite Series, 2nd ed., MacMillan, London, 
1926. 
6. R.L. Graham, D. E. Knoth, and O. Patashnik, Concrete Mathematics, Addison Wesley, Reading, 
Mass., 1989. 
7, R. Johnsonbaugh, Summing an alternating series, this MONTHLY, 86 (1979) 637-648. 
8. D.E Kuuth and T. J. Buckholtz, Computation of Tangent, Euler, and Bernoulli numbers, Math. 
Comput., 21 (1967) 663-688. 
9. N. Nérlund, Vorlesungen tiber Differenzenrechnung, Springer-Verlag, Berlin, 1924, 
10. R. D. North, personal communications, 1988. 
11. N.J.A. Sloane, A Handbook of Integer Sequences, Academic Press, New York, 1973. 


13. A spigot algorithm for the digits of 7 (1995) 


Paper 13: Stanley Rabinowitz and Stan Wagon, “A spigot algorithm for the 
digits of 7,” American Mathematical Monthly, vol. 102 (March 1995), p. 195-203. 
Copyright 1995 Mathematical Association of America. All Rights Reserved. 


Synopsis: 

In this paper, Rabinowitz and Wagon introduce a very interesting “spigot al- 
gorithm” for the digits of 7. In particular, the algorithm, which acts on a two- 
dimensional table of data, generates the decimal digits of 7 one by one. The 
algorithm acts only on modest-sized integer data — no floating-point arithmetic is 
involved. The process was discovered by some experimentation using the Mathe- 
matica software. 

This scheme is different in nature from the BBP algorithm, to be presented 
in the next chapter, in that it cannot directly generate digits of 7 at an arbitrary 
starting point. But it is very simple, and, unlike the BBP formula and algorithm, 
it works in base ten. 


Keywords: Algorithms 
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A Spigot Algorithm for the Digits of tr 


Stanley Rabinowitz and Stan Wagon 


It is remarkable that the algorithm illustrated in Table 1, which uses no floating- 
point arithmetic, produces the digits of a. The algorithm starts with some 2s, in 
columns headed by the fractions shown. Each entry is multiplied by 10. Then, 
starting from the right, the entries are reduced modulo den, where the head of the 
column is num /den, producing a quotient q and remainder r. The remainder is 
left in place and q X num is carried one column left. This reduce-and-carry is 
continued all the way left. The tens digit of the leftmost result is the next digit of 
ar. The process continues with the multiplication of the remainders by 10, the 
reductions modulo the denominators, and the augmented carrying. 


Tapes 1. The workings of an algorithm that produces digits of 7. The dashed line indicates the key 
step: starting from the right, entries are reduced modulo the denominator of the column head 
(25, 23, 21,...,resp.), with the quotients, after multiplication by the numerator (12, 11, 10,...)}, carried 
left. For example, the 20 in the a's column yields a remainder of 1 and a left carry of 1 - 9 = 9. After 
the leftmost carries, the tens digits are 3, 1,4,1. To get more digits of 7 one must start with a longer 
string of 2s. 


§ 7 8 9 

13 15 17 19 

Initialize 5 2.9 8 So 
«10 20 20 20 20 20 720 20 20 20 20 20 20 20 
Carry 3 «a fey a2, Leg +0, da ety Eos £25 gay Oy 20-, — 
330 1.32, |'92,,\32, | 30, \'32. V22, 1\28..|29. | 20, | 20, 20, h30 , 

Remainders 0 224 “goad Seed MP dg ya) Ny) Sag) fog) “ag 
«10 20 20 40 30 100 10 130 120 10 200 200 200 
Carry +20 +33 +40 +65 +48 +98 +88 +72 +150 +132 ($96; _— 
40 53 80 95 148 108 218 192 160 332 296%200.. 

Remainders 1 3 3 5 5 4 8 5 8 17 20 “Oe” 
«10 10 30 30 50 50 40 80 50 80 170 200 
Carry +63 +64 +90+120 +88 +0 
34 60 70 90 92 103 144 140 200 258 200 

Remainders 1 0 0 0 4 12 9 4 10 6 16 
His 10 0 0 0 40 120 90 40 100 60 160 
Carry In id +2 +9 +24 +55 +84 +63 +48 +72 +60 +66 +0 
14 12 9 24 55 124 183 138 112 160 126 160 


o 


oll cool 


This algorithm is a “spigot” algorithm: it pumps out digits one at a time and 
does not use the digits after they are computed. Moreover, the digits are generated 
without any use of high-precision (or low-precision) operations on floating-point 
real numbers; the entire algorithm uses only ordinary integer arithmetic on 
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relatively small integers. For example, to obtain the first 5,000 digits of a requires 
only arithmetic operations on integers less than 600,000,000. Although high- 
precision floating-point routines are built up from integer operations, the algo- 
rithms in this paper are quite simple and do not simulate floating-point computa- 
tions. 

In order to motivate the v-algorithm, we first discuss the much simpler case of 
e, for which a spigot algorithm was discovered by Sale [Sale]. His algorithm is the 
basis of the discussion in §1. 


1. A NUMBER SYSTEM IN WHICH e’s DIGITS ARE PERIODIC, A real num- 
ber’s decimal representation may be interpreted as an infinitely nested expression; 
for example: 


v2 = 141421356... = 1+ (4+ w(E + (4 + (2 + HU+ ---))))). 


Some interesting and useful representations may be obtained if we chartge the 
base-sequence, which in the case above is (34, +4, 4. 7; ..-). For example, using the 
base b = (3, 4, +, 4, ...) yields the following form, called a mixed-radix represen- 


tation (see [Knu, §4.1}): 
a&y + 3( a + 3( 42 “E i(43 r $(44 + §(45 + ---))))), 


where the a; (the digits) are nonnegative integers. If 0 <a, <i for i= 1, the 
representation is called regular. Mixed-radix representations will be denoted by 
(9; @1, Ap, 43, 44,...),. For base b, every positive real number has a regular 
representation and representations are unique provided we exclude representa- 
tions that terminate with maximal digits (otherwise, for example, += 
(G; 1, 0, 0,...), = (0; 0, 2,3, 4,5, 6,...),,); from now on and for ail bases, we exclude 
such representations. The proof of the following Lemma is in Appendix 1. 


Lemma i(a). If i > 1,(0;0,0,...,0, 4; 4;41,---), < 45 im particular, 
(0; @), 2, @3,44,-.-), < 1. 

(b). Representations using the mixed-radix base » are unique. 

(c). The integer part of (a3 4,, 42, 43, 44,--.), & Gq and the fractional part is 
(0; a4, Ao, @3, 4, -.p- 


In this number system some irrationals become periodic. For example, e = 
(2;1,1,1,1,...),3 this is just a restatement of the infinite series LA as 
1+ 40+ 404+ 40+ 41 + E11 +...)))). Rational numbers in this system cor- 
respond to digit-sequences that terminate (Appendix 1, Lemma 2). 

The decimal digits of a real number x in [0,10) can be obtained by taking the 
integer part of x, multiplying its fractional part by 10, taking the integer part of the 
result, multiply the resulting fractional part by 10, and so on. In some mixed-radix 
bases, this is especially simple. If x = (ap; @,,a2,...,4,),, then 10x = 
(1049; 10a,, 10a,,10a@3,...,10a,),. The latter may not be a regular expression: 
some digits may be too big, But we can decrease digits by reducing them modulo 7, 
where 7 is the denominator of the corresponding element of b. Starting these 
reductions at the right end, we carry the quotients left, eventually getting the 
regular representation of 10x. Thus multiplying by 10 is algorithmically straightfor- 
ward. Taking the integer and fractional parts for b-representations is also easy, 
thanks to Lemma 1(c). 
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We can now give the algorithm to get the first » base-10 digits of e. A proof of 
correctness—the error analysis showing that n + 2 mixed-radix digits suffice' to 
get n base-10 digits—is given as Lemma 3 in Appendix 1. 


Algorithm e-spigot 


1. Initialize: Let the first digit be 2 and initialize an array A of length n + 1 to 
(1, 1,1,..., 1). 
2, Repeat n — 1 times: 
Multiply by 10: Multiply each entry of A by 10. 
Take the fractional part: Starting from the right, reduce the ith entry of A 
modulo i + 1, carrying the quotient one place left. 
Output the next digit: The final quotient is the next digit of e. 


The first few steps of this algorithm, starting with an array of 10 1s (this 
corresponds to 11 mixed-radix digits, good for 9 digits of e; only 5 are shown), are 
displayed in Table 2. 


Taste 2. The workings of a spigot algorithm for the digits of e (in bold). The reductions in the 
column headed + are performed modulo i. The leftmost base-10 real numbers are the values of the 
rows viewed as mixed-radix representations. Since only 11 mixed-radix digits start the algorithm, the 
first base-10 number is only an approximation to e. 


2.718281826... 
7,18281826... 
carries 


= 


IN OW 


0.18281826... 
1.8281826... 
carries 


3 
#2 
32 


0.8281826... 
8.281826... 
carries 


0.281826... 
2.81826... 
carries 


mukhooakSrubbookbhsy 


wih Bu tbSuubooSbs. 
woboo SKS8uuboosbs- 


w 
On 


0.81826... 


2. ASPIGOT FOR DIGITS OF Tr. The ideas of §1 lead to a spigot algorithm for 
ar, but there are additional complexities and additional interesting questions that 
distinguish 7 from e. Our starting point is the following moderately well-known 


‘Any digit-producing algorithm for a presumed-normal number x suffers from a drawback that, 
although unlikely, can impinge on the result. If x is between 1 and 10 and the algorithm says that the 
first 100 digits of x are, say, 4,6,5,0,7,...,3,9,9,9, 9,9 then one cannot be sure that the last 6 digits 
are correct. They will be the digits of a certain approximation to x that is within 5 - 10~" of the true 
value, One cannot simply go farther until a non-9 is reached, because memory allocations must be made 
in advance. The user must realize that a terminating string of 9s is a red flag concerning those digits 
and even with no 9s, the last digit might be incorrect. In practice, one might ask for, say, 6 extra digits, 
reducing the odds of this problem to one in a million. 
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series: 
«© (il)? Qi+l 
T= Soe 
Zp (2i+ 1)! 
This series can be derived from the Wallis product for a; another approach uses 
an acceleration technique called Euler’s transform applied to the series 7 = 4 — 
442 -—44.,.. These proofs, together with three others and references to 


earlier sources, may be found in [Li]. We let &!! denote the product 1:3-5--- k 
for odd integers k; then the series is equivalent to 


7 ” i! 1 Ted P2333 
2? Ore os. aoe aise 
which expands to become 
T 1 2 3 4 
5 =1+ (1+ (1+ 3(1+5a+-->}]). 


This last expression leads to the mixed-radix base c = (4, ?, 3, $,...), with respect 


to which a is simply (2;2,2,2,2,2,...),. For a regular representation in base c, 
the digit in the ith place must lie in the interval [0, 2i]. Unfortunately, base c is less 
accommodating than b. 


Lemma 4 (Proof in Appendix 1). The base-c number with maximal digits, 
(0; 2, 4, 6,8,...), represents 2; hence regular representations of the form 
(0; a, b,c,...), lie between 0 and 2. 


Lemma 4 implies that ec-representations are not unique. For example, 
(0; 0, 4,6,8,...), = 2 -— $= $, whence (0;0,2,3,4,...), = 2 = (0; 2,0,0,0,...). 
More relevant algorithmically, integer and fractional parts using ¢ are not straight- 
forward, as they are for b. The integer part of (a9; @,,4,...), is either ag or 
a, + 1 according as (0; a1, 42,...) is in [0, 1) or [1,2). This problem is surmounted 
by leaving the units digit of a) in place during the next iteration and calling the 
tens digit of a, a predigit. The predigits must be temporarily held because 
occasionally (once every 20 iterations, roughly) the next predigit is a 10; this will 
happen when the carry, which is between 0 and 19, is greater than 10 and, 
simultaneously, the leftover units digit of a, is 9, which becomes 90 in the 
multiply-by-10 step. This event requires that the held number be increased by 1 
before being released. Specific details of the algorithm follow; the presentation at 
the beginning of this paper sidestepped the problem of the occasional 10. The 
proof that [1072/3] mixed-radix digits suffice for n digits of 7 is in Appendix 1 
(Lemma 5). Appendix 2 contains a Pascal implementation of this algorithm. 


Algorithm 7r-spigot 


1. Initialize: Let A = (2,2,2,2,...,2) be an array of length [107/31]. 
2. Repeat n times: 

Multiply by 10: Multiply each entry of A by 10. 

Put A into regular form: Starting from the right, reduce the ith element of A 
(corresponding to e-entry (i — 1)/(2i ~ 1)) modulo 2i — 1, to get a 
quotient q and a remainder r. Leave r in place and carry g(i — 1) one 
place left. The last integer carried (from the position where i — 1 = 2) 
may be as large as 19. 
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Get the next predigit: Reduce the leftmost entry of A (which is at most 
109[= 9 - 10 + 19}) modulo 10. The quotient, g, is the new predigit of 
ar, the remainder staying in place. 

Adjust the predigits: If q is neither 9 nor 10, release the held predigits as true 
digits of + and hold gq. If q is 9, add g to the queue of held predigits. If 
q is 10 then: 

« set the current predigit to 0 and hold it; 
« increase all other held predigits by 1 (9 becomes 0); 
« release as true digits of 7 ali but the current held predigit. 


This algorithm uses only integer arithmetic and is easy to program. The table at 
the beginning of the paper shows it in action, starting with 13 mixed-radix digits of 
ar (good for 4 base-10 digits). To clarify the working of the algorithm, note that the 
(finite) first row of Table 1 is a mixed-radix representation of 3.1414796..., 
the second row represents 31.414796..., the fifth row represents 1.414796..., the 
sixth row is 14.14796..., the ninth row is 4.14796..., and so on. Table 3 shows 
the result of a computation using a larger initial array; the holding aspect does not 
become relevant until the 32nd digit. 


Tasce 3. The actual digits of az (bottom) compared to the sequence of leftmost base-c digits for 35 
iterations with a starting array of 116 2s (good for 35 digits). At the 32nd iteration a 102 shows up, 
yielding a predigit of 10. 


30 13 41 15 58 92 26 64 53 35 58 89 97 78 92 32 23 38 84 45 62 26 63 42 33 38 82 32 27 78 94 49 102 28 87 


31415926535897932384626433832795 028 


We repeat that the algorithm uses only integer operations. To get 5,000 digits of 
ar requires only integer arithmetic on numbers less than 600,000,000. The aigo- 
rithm leads naturally to the question of improving it to one that is essentially as 
simple as e-spigot. 


Question. 1s there a base d of rationals such that 7 has a d-representation that is 
periodic, or an arithmetic progression, and such that a, is always the integer part 
of (ag3 44, 42,.--)g? 


Gosper [Gos, p. 32] has discovered a series for 7 that brings us tantalizingly 
close to spigot-perfection: 


1 a-3 i ose. a5 

T= 34 Oot OTe 35t M78 3101 3 
i oe 4:5 4°7 
60 7-8-3 10'11-3 13-143 


8+ 


34+ 


He obtained this series by using a refinement of the Euler transform on 4 — ; + 
4 —44.... Gosper’s series leads to the base d = (4, is, 330- Hg»---), With 
respect to which wv is (3; 8, 13,18,...). A computation shows that 
(0; 59, 167, 329, 545,...)4 = 1.092..., a substantial improvement over the 2 that 


arose for c. Under the usual randomness assumption for 7’s digits, the odds of a 
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bad predigit in base c are 1 in 20, while in base d they decrease to less than 1 in 
110; this is because a d-predigit of 10 occurs only when the remainder is a 9 (which 
becomes 90) and the carry is a 10. The former happens 10% of the time, while the 
latter happens no more than once in 11 iterations because the carry is the integer 
part of a real between 0 and 10.93. So base d is within 1% of spigot-perfection. 
Because Gosper’s series converges more quickly than the one we used, it has less 
memory requirements: n digits of + require an initial array of length n; however, 
the arithmetic on the array will involve integers larger than those in an array of the 
same size using base c. 

One way to improve the Gosper-series approach is to reduce the fractions in d 
to lowest terms. Then the regular number with maximal digits is (0; 59, 27, 21, 
38,...)y, which equals 1.0000476468.... It is not hard to see that the regular 
representation of 7 is unchanged in this new base. However, the work expended 
in reducing to lowest terms outweighs the gain made in reducing the number of 
times a 10 appears as a predigit. Thus it is likely that an affirmative answer to the 
question above is of more theoretical than practical interest. 

The spigot algorithm for m is by no means competitive with the recently 
discovered fast algorithms (due to the Borwein brothers, the Chudnovsky brothers, 
and others) that have been used to compute hundreds of millions of digits of 7 
(see [BBB]). But the spigot algorithm does have the advantage of avoiding all 
floating-point computations; thus it is easily implemented on a home computer 
where it can produce thousands of digits in a few minutes. Moreover, it gives the 
result directly in base 10 (most other z-algorithms produce the result in binary or 
some internal format and a second pass must be made to obtain decimal digits). 

The algorithm given here can be made to run faster by outputting multiple 
digits at a time. For example, to get five decimal digits at a time, simply compute 
the digits of a using base 100,000. This can be done by multiplying by 100,000 
instead of 10 in the main step. The integer part is then the next “digit” in base 
100,000. 

If one is working in base 100,000 and knows in advance that the portion of digits 
to be computed does not contain the string 00000, then one can omit the lengthy 
part of the algorithm that adjusts the predigits. This can lead to an exceedingly 
short computer program. For example, Rabinowitz [Rab] used this idea to exhibit a 
14-line Fortran program that outputs 1,000 decimal digits of 7. 

Finally, we mention that the algorithm can be parallelized, in which case it 
becomes blindingly fast up to about 10,000 digits. 

For examples of spigot algorithms for other functions, see [Abd]. 


APPENDIX I. FIVE LEMMAS 
Lemma I(a). Jf i > 1,(0;0,0,...,0, @;,4;41,--.p» < 73 in particular, (0; a,, ay, a3, 
Ay3000 gS 1s 

(b). Representations using mixed-radix base b are unique. 

(c). The integer part of (a9; 41,42, 43, 44,...)y iS Gq and the fractional part is 
(0; @;, @2, 43, A4,-- py. 


Proof: (a). It suffices to prove that Lz_;,(& - D/k!= 1/i!, which follows from 
the fact that*the series telescopes to: 
1 1 


+l G42)” (+3)! 


1 1 1 1 
[= - G+)! Gea ~ (+2)! 
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(b). Suppose (a3 4), @2, 43, @4,-..), ANd (Co; C1, Co, C3, C4,--- Jy Tepresent the 
same real number. Then, for some i, 0 = Ly_jd,/k!, where |d,| < & and d, # 0. 
But then |d;|/i!< LR_;.,ld,|/k!, contradicting (a). 

(c). This follows from (a). 


Lemma 2. A positive number is rational iff its digits using the mixed-radix base b are 
eventually 0, 


Proof: The reverse direction is obvious. For the forward direction we use a 
sublemma. 


Sublemma. For any integers t and n, with 0 <n <t}, there are integers d, in [0, i] 
such that n= d,t(t ~ DG — 2)-+- 4-3 + dott — DU — 2)-+-5- 44 
vos +d,_,tHt - + d,_.t + d,_4. 


Proof: By induction on ¢. If n < t! write n as qt +r withO<r<tand0<q< 
(t — 1)!. By induction there is a sequence (d,, d,...d,_3, d,_») that is a solution 
for q with respect to terms (¢ — 1Xt — 2)---4:3, and the like, whence 
(d,, d,,...d,_3,d,_>, r) is a solution for n w.r.t. the terms t(t — D(t — 2)--- 4+ 3, 
and the like. 


Returning to Lemma 2’s proof, suppose a positive rational s/t is given. Use the 
sublemma to express s(t— 1)! in the form d,t(t — 1Xt— 2)---4-3 + dt 
(f-— IDG -—2)---5-44-+-- +d,_4t¢ — D+ d,_,t + d,_,. Dividing by t! then 
yields a representation of s/t as a sum of reciprocals of factorials with appropri- 
ately small coefficients, which is the same as a terminating representation in the 
mixed-radix base b. 


Lemma 3. The algorithm for digits of e is correct. 


Proof: It must be shown that n + 2 mixed-radix digits of e suffice to get n base-10 
digits of e. We first prove that if n > 28 (=[10e]), then n mixed-radix digits 
suffice for n base-10 digits. Using n mixed-radix digits means we are actually 
getting the base-10 digits of e, = (2;1,1,1,...,1) = L7_91/i!. Thus we must show 
that 

e—e, <5-10~" (see footnote at beginning of paper). A geometric series estima- 
tion of the tail of the series shows that e — e, < 2/(m + 1)!, and then Stirling’s 


formula yields 
2 1 e\" 1 ) 
—————_ ¢ — <|/-} </—]. 
(n+ 1)! ar [=] (= 


If n < 28 then a direct computation of the digits shows that n + 2 mixed-radix 
digits suffice. 


Lemma 4. The base-c number with maximal digits, (0;2, 4, 6,8,...), represents 2; 
hence regular representations of the form (0; a, b,c,...), lie between 0 and 2. 


Proof: Instead of giving a formal proof, we show how some Mathematica computa- 
tions led to the result (and a proof). In terms of series, the lemma states that 

bd 2i)i} 

= Git _, 


i (i +,1)! 
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A rough calculation showed that the sum is near 2. Then a rational computation of 
the remainders—the differences between the partial sums and 2—-yielded the 
following sequence. 


4 4 16 16 32 32 256 (256 512 _512 
The pattern in these remainders was found by dividing each by the preceding one, 
which yielded: 


5> 79 99 119 139 157 179 199 21° 


Induction proves the pattern to be valid in general; it follows that the remainders 
have the closed form 2"*!/ (2 * ) which converges to 0, as claimed. 


Lemma 5. The algorithm for digits of m is correct. 


Proof: As for e, we look at 7 — 7,,, where 7,, = (2;2,2,...,2),. This error is the 
tail of our main series for 7: L7_,,(é!)’2'*'/(Qi + W!. This tail is less than twice its 
first term since each subsequent term is less than half its predecessor, leading us to 
study m!?2™+?/(2m + 1)!. Splitting the denominator into evens and odds turns 
this into: m!2?/(3 -5+++(2m + 1)), which is less than 3m! 2?7/(2 - 4---+ (2m)), or 
1/3 - 2-1). It is easy to see (using the fact that 3 < log, 2) that this last is less 
than 5-10~” when m = |107/3], as claimed. 


APPENDIX 2. PASCAL CODE 

The following program, for which we are grateful to Macalester student Simeon 
Simeonoy, implements the algorithm 7-spigot. This code makes use of the fact that 
the queue of predigits always has a pile of 9s to the right of its leftmost member, 
and so only this leftmost predigit and the number of 9s need be remembered. The 
program computes 1000 digits of 7 and requires a version of Pascal with a longint 
data type (32-bit integer). 


Program Pi_Spigot; 


const n = 1000; 
len =10*n div 3; 
var i, j, k, q, x, nines, predigit : integer; 
a : arrayf1..len] of lLongint; 
begin 
for j == 1 to len do alj] := 2; {Start with 2s} 
nines := 0; predigit := 0 {First predigit is a 0} 
for j := 1 to n do 
begin q := Q; 
for i := Len downto 1 do {Work backwards} 
begin 
x = 10*alLil + q*i; 
ali] := x mod ¢€2*i-1); 
q := x div (2*%i-1); 
end; 
al1J := q mod 10; q := q div 10; 
if q = 9 then nines := nines + 1 


else if q=10 then 
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begin writeC€predigit +1); 


for k := 1 to nines do write(0); {zeros} 
predigit := 0; nines := 0 
end 


else begin 
writeCpredigit); predigit := q; 
if nines <> 0 then 


begin 
for k := 1 to nines do write(9); 
nines := 0 
end 
end 
end; 
writeln(predigit); 
end. 


ADDED IN Proor. The latest version of Mathematica (2.3) can sum many of the series that occur in 
this paper. It takes only a second or so to get 7/2 as the sum of the crucial series at the beginning of 
section 2, to get 1/i! for the series in Lemma 1’s proof, and to get 2 as the sum of the series in Lemma 
4’s proof. 
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Paper 14: David H. Bailey, Peter B. Borwein and Simon Plouffe, “On the rapid 
computation of various polylogarithmic constants,” Mathematics of Computation, 
vol. 66 (1997), p. 903-913. Reprinted by permission of the American Mathematical 
Society. 


Synopsis: 

For many years, it had been widely presumed impossible to calculate, say, the 
millionth binary digit of 7 any faster than simply computing all one million digits. 
Thus it was with considerable interest that in 1996 the above authors announced 
that they could do just that. The authors presented a surprisingly simple algorithm, 


based on the formula 


1 4 2 1 1 
ios a k ( ) , 
aa 8k+1 8k+4 8k+5 8k+6 
that permits one to directly calculate a string of binary or hexadecimal digits of 
am, beginning at position n, without needing to compute any of the first n — 1 
digits. The authors further presented formulas of this type for numerous other 
mathematical constants. 

However, the real story of interest here is that the above formula (now known 
as the BBP formula) was discovered by a computer program, running Helaman 
Ferguson’s PSLQ integer relation algorithm. This was the first notable discovery 
by the PSLQ algorithm; since 1997, a large number of other previously unknown 
mathematical identities have been found in this process. Indeed, PSLQ is now a 
premier tool of the rapidly expanding field of experimental mathematics. 
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ON THE RAPID COMPUTATION OF VARIOUS 
POLYLOGARITHMIC CONSTANTS 


DAVID BAILEY, PETER BORWEIN, AND SIMON PLOUFFE 


ABSTRACT. We give algorithms for the computation of the d-th digit of cer- 
tain transcendental numbers in various bases. These algorithms can be easily 
implemented (multiple precision arithmetic is not needed), require virtually 
no memory, and feature run times that scale nearly linearly with the order of 
the digit desired. They make it feasible to compute, for example, the billionth 
binary digit of log (2) or z on a modest work station in a few hours run time. 

We demonstrate this technique by computing the ten billionth hexadecimal 
digit of 1, the billionth hexadecimal digits of x7, log(2) and log?(2), and the 
ten billionth decimal digit of log(9/10). 

These calculations rest on the observation that very special types of iden- 
tities exist for certain numbers like 1, 77, log(2) and log?(2). These are essen- 
tially polylogarithmic ladders in an integer base. A number of these identities 
that we derive in this work appear to be new, for example the critical identity 
for 7m: 


come 4 2 1 1 
BD eng Ga a ar ee) 
i=0 ‘ 


1. INTRODUCTION 


It is widely believed that computing just the d-th digit of a number like 7 is 
really no easier than computing all of the first d digits. From a bit complexity 
point of view this may well be true, although it is probably very hard to prove. 
What we will show is that it is possible to compute just the d-th digit of many 
transcendentals in (essentially) linear time and logarithmic space. So while this is 
not of fundamentally lower complexity than the best known algorithms (for say 7 
or log(2)), this makes such calculations feasible on modest workstations without 
needing to implement arbitrary precision arithmetic. 

We illustrate this by computing the ten billionth hexadecimal digit of 7, the 
billionth hexadecimal digits of 7, log(2) and log’(2), and the ten billionth decimal 
digit of log(9/10). Details are given in Section 4. A previous result in this same 
spirit is the Rabinowitz-Wagon “spigot” algorithm for 7. In that scheme, however, 
the computation of the digit at position mn depends on ail digits preceding position 
n. 

We are interested in computing in polynomially logarithmic space and polyno- 
mial time. This class is usually denoted SC (space = log?) (d) and time = d° 
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where d is the place of the “digit” to be computed). Actually we are most interested 
in the space we will denote by SC* of polynomially logarithmic space and (almost) 
linear time (here we want the time = O(dlog?‘(d))). There is always a possible 
ambiguity when computing a digit string base b in distinguishing a sequence of 
digits a{b — 1)(b — 1)(b — 1) from (a + 1)000. In this particular case we consider 
either representation as an acceptable computation. In practice this problem does 
not arise. 

It is not known whether division is possible in SC, similarly it is not known 
whether base change is possible in SC. The situation is even worse in SC*, where it 
is not even known whether multiplication is possible. If two numbers are in SC* (in 
the same base) then their product computes in time = O(d? log?‘ (d)) and is in 
SC but not obviously in SC*. The d? factor here is present because the logarithmic 
space requirement precludes the usage of advanced multiplication techniques, such 
as those based on FFTs. 

We will not dwell on complexity issues except to point out that different algo- 
rithms are needed for different bases (at least given our current ignorance about 
base change) and very little closure exists on the class of numbers with d-th digit 
computable in SC. Various of the complexity related issues are discussed in [6], {8}, 
9}, (14), (14). 

As we will show in Section 3, the class of numbers we can compute in SC* in 
base 6 includes all numbers of the form 


p(k) 
ae < bekg(k)’ 


where p and q are polynomials with integer coefficients and c is a positive integer. 
Since addition is possible in SC*, integer linear combinations of such numbers are 
also feasible (provided the base is fixed). 

The algorithm for the binary digits of 7, which also shows that 7 is in SC* in 
base 2, rests on the following remarkable identity: 


Theorem 1. The following identity holds: 


oo 


(1.2) 


3. Ha 2 1 1 
CDi ee 


This can also be written as: 


(1.3) t= La , [pa] = (4, 0, 0, —2, —1, -1, 0,0] 
<7 1618; 


where the overbar notation indicates that the sequence is periodic. 


Proof. This identity is equivalent to: 


1/v2 Mer Gh 4 _ 9.5 
(1.4) «=| Ay/2 — 8x3 — 4/22 8a diss 
0 


1-28 


which on substituting y := 2x becomes 


1 
16y — 16 
he ee 
= [ ye —2yt4y—4 


14. RAPID COMPUTATION OF POLYLOGARITHMIC CONSTANTS (1997) 223 


THE RAPID COMPUTATION OF VARIOUS POLYLOGARITHMIC CONSTANTS 905 


The equivalence of (1.2) and (1.4) is straightforward. It follows from the identity 


Ege /v2 © k-148i 
au | grit’ dx 
[ Lg Jo » 


iz) 


1 1 
= & 2 i{Q3 . 
fe a 16 (8i + k) 


That the integral (1.4) evaluates to 7 is an exercise in partial fractions most easily 
done in Maple or Mathematica. Oo 


This proof entirely conceals the route to discovery. We found the identity (1.2) 
by a combination of inspired guessing and extensive searching using the PSLQ 
integer relation algorithm [3],{12}. 

Shortly after the authors originally announced the result (1.2), several colleagues, 
including Helaman Ferguson, Tom Hales, Victor Adamchik, Stan Wagon, Donald 
Knuth and Robert Harley, pointed out to us other formulas for 7 of this type. One 
intriguing example is 


fa et M2 ES 
~ ep 16 B+1° 4642 ° 444+3 4645 4646 447" 


which can be written more compactly as 


roa sk 
ete @ 2 1 
= a ad Bo eS 


i=0 
In [2], this and some related identities are derived using Mathematica. 

As it turns out, these other formulas for 7 can all be written as formula (1.2) 
plus a rational multiple of the identity 


i252 Bog ee a Sg eg 1) 
~ 4 16 BE +1 Bi +2 © B43 © BI+4 5 BI+5  B+6 B47" 


i=0 
The proof of this identity is similar to that of Theorem 1. 

The identities of the next section and Section 5 show that, in base 2, 7”, log”(2) 
and various other constants, including {log(2), log(3),... , log(22)} are in SC*. (We 
don’t know however if log(23) is even in SC.) 

We will describe the algorithm in Section 3. Complexity issues are discussed in 
[3], [5], [6}, [7], [8], [9], [14], [19], [21] and algorithmic issues in [5], [6], [7], {8], [14]. 
The requisite special function theory may be found in {1], [5], {15}, [16], [17], {20}. 


2, IDENTITIES 


As usual, we define the m-th polylogarithm L,, by 
(2.1) Lm(z) := > am la<t. 


The most basic identity is 
(2.2) —log(1-— 27") = £,(1/2") 


which shows that log(1 — 27”) is in SC* base 2 for integer n. (See also section 5.) 
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Much Jess obvious are the identities 


(2.3) a? = 36L9(1/2) — 36L2(1/4) — 12L(1/8) + 6L2 (1/64) 
and 
(2.4) log?(2) = 4L2(1/2) — 6L2(1/4) — 2L2(1/8) + L2(1/64). 
These can be written as 
fone] 

2 ay \ 
(2.5) n= po sur fa,] = (| -3,-2, 3,0, 
2. 279) = 2 N° 4 ;] = [2, -10, —7, —10, 2, —1). 
(2.6) log” (2) D3 [bi] = 2,-10, =7, -10, 2, J] 


Here the overline notation indicates that the sequences repeat. Thus we see that 
x? and log?(2) are in SC* in base 2. These two formulas can alternately be written 


aly Sy 16 24 8 Got 
~ 84 64 (6141)? (Gi +2)? (Gi +3)? (Gi +4)? © (61+5)?” 


1s 1 /-16 16 40 14 
2 
2 = ar —_ = 
log’ (2) = 5 en ep (itl? Gita (i+3y 
10 rs 1 ) 

(6¢+4)?  (6i+5)?”" 
Identities (2.3)-(2-6) are examples of polylogarithmic Jadders in the base 1/2 in 
the sense of [16]. As with (1.2) we found them by searching for identities of this 
type using an integer relation algorithm. We have not found them directly in print. 
However (2.5) follows from equation (4.70) of [15] with a = 7/3,8 = 2/2 and y = 
1/3. Identity (2.6) now follows from the well known identity 


(2.7) 12L(1/2) = n* — 6log?(2). 
A distinct but similar formula that we have found for 7? is 
> 16 16 8 16 
2 = SRE cipereentieionasermeniaagin Scien : dapper emacs * Siguiente‘, Sami >. pcmatgeegmseemsan agian 
i d 16! (& +12 (+2)? (8% +3)? (%& +4)? 
4 4 2 


(8i+5)? (846)? + (81 + 72) 
which can be derived from the methods of section 1. 
There are several ladder identities involving Ls: 


(2.8) 35/2¢(3) — m* log(2) = 36L3(1/2) — 18L3(1/4) — 4£3(1/8) + L3(1/64) , 
(2.9)  2log*(2) — 7¢(3) = —24L3(1/2) + 1823(1/4) + 4L3(1/8) — L3(1/64), 
(2.10) 

10 log? (2) — 2m? log(2) = —48L3(1/2) + 54L3(1/4) + 12L3(1/8) — 3L3(1/64). 


The favored algorithms for 2 of the last centuries involved some variant of 
Machin’s 1706 formula: 


1 1 
(2.11) ‘ = 4arctan 5 arctan 330° 
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There are many related formulas {15}, [16], [17], [20] but to be useful to us all the 
arguments of the arctans have to be a power of a common base, and we have not 
discovered any such formula for 7 . One can however write 


1 1 
(2.12) . = 2 arctan —= + arctan —= 


v2 v8 


This can be written as 


(2.13) VOn = 4f(1/2) + f(1/8) where f(a) := y ve 


and allows for the calculation of /27 in SC*. 
Another two identities involving Catalan’s constant G, 7 and log(2) are: 


mlog(2) Ci TTT) 

: = i 4) = 1,1,1, »-i,-1,-1 
(2.14) CG 5 rears le] =[ 0 0] 
and 

5 2 log*(2) _ yn _ di pio oo 
(2.15) 96" = 8 = Dara (d;] = (1, es ad id id} , lj. 
i=1 


These may be found in [17, p. 105, p. 151]. Thus 8G — 7 log(2) is also in SC* in 
base 2, but it is open and interesting as to whether G is itself in SC* in base 2. 
A family of base 2 ladder identities exist: 
Lm(1/64) = Em(1/8) 2 Lm(1/4) 4 Lm(1/2) 5 (—log(2))” 


ar aa a a TT. 


2 (-log(2))"? _ rf (= log(2))""* _ 403 ¢(5) (= Jog(2))"™ 

54 (m — 2)! 486 (m — 4)! 1296 (m — 5)! 
The above identity holds for 1 < m < 5; when the arguments to factorials are 
negative they are taken to be infinite so the corresponding terms disappear. See 
[16, p. 45]. 

As in the case of formula (1.2) for 7, colleagues of the authors have subsequently 
pointed out several other formulas of this type for various constants. Three exam- 
ples reported by Knuth, which are based on formulas in [13, p. 17, 18, 22, 47, 139], 
are 


=0. 


oY , 1/2 1/4 1/8 
2 In(1 
V2 In(1 + V2) = = Deas Tera Be 347) 
1/2 1/4 1/8 
Qarctan(1 ~ 
V2 arctan(1/V2) = Sa sri S58 BES 847)” 


oo 


digs a 1 1/2 «1/4 
fen ase a Sis 
archenkl/3) Die saa aa N44 4s) 


Thus these constants are also in class SC*. Some other examples can be found 
in [18}. 
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3, THE ALGORITHM 


Our algorithm to compute individual base-6 digits of certain constants is based 
on the binary scheme for exponentiation, wherein one evaluates 2” rapidly by suc- 
cessive squaring and multiplication. This reduces the number of multiplications to 
less than 2log,(n). According to Knuth [14], where details are given, this trick 
goes back at least to 200 B.C. In our application, we need to perform exponentia- 
tion modulo a positive integer c, but the overall scheme is the sare — one merely 
performs all operations modulo ¢. An efficient formulation of this algorithm is as 
follows. 

To compute r = 6” mod c, first set £ to be the largest power of two < n, and set 
r=1. Then 

A:ifn>tthenr+brmodc; nen-—-t; endif 
t<- t/2 
iff>1thenr«r? mode; goto A; endif 

Here and in what follows, “mod” is used in the binary operator sense, namely 
as the binary function defined by x mod y := x — |[x/yly. Note that the above 
algorithm is entirely performed with positive integers that do not exceed c? in size. 
Thus it can be correctly performed, without round-off error, provided a numeric 
precision of at least 1 + 2 log, c bits is used. 

Consider now a constant defined by a series of the form 


aera 
ae > b&p(ky’ 


where 6 and ¢ are positive integers and p(k) is a polynomial with integer coefficients. 
First observe that the digits in the base b expansion of S beginning at position n+1 
can be obtained from the fractional part of b”S. Thus we can write 

& pn—ck 


b 
3.4 6° S mod1l= —— mod1 
oe 2 Oe) 


n/c} pn —ck oO —ck 
b”—°* mod p(k) bere 
= ———_.—— mod 1 + y 

k=0 p(k) 


For each term of the first summation, the binary exponentiation scheme is used 
to evaluate the numerator. Then floating-point arithmetic is used to perform the 
division and add the result to the sum mod 1. The second summation, where 
the exponent of 0 is negative, may be evaluated as written using floating-point 
arithmetic. It is only necessary to compute a few terms of this second summation, 
just enough to insure that the remaining terms sum to less than the “epsilon” of 
the floating-point arithmetic being used. The final result, a fraction between 0 and 
1, is then converted to the desired base 0. 

Since floating-point arithmetic is used here in divisions and in addition modulo 
1, the result is of course subject to round-off error. If the floating-point arithmetic 
system being used has the property that the result of each individual floating-point 
operation is in error by at most one bit (as in systems implementing the IEEE 
arithmetic standard), then no more than log,(2n) bits of the final result will be 
corrupted. This is actually a generous estimate, since it does not assume any 
cancelation of errors, which would yield a lower estimate. In any event, it is clear 


+ mod 1. 
k=|n/e|+1 pl ) 
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that ordinary IEEE 64-bit arithmetic is sufficient to obtain a numerically significant 
result for even a large computation, and “quad precision” (i.e. 128-bit) arithmetic, 
if available, can insure that the final result is accurate to several digits beyond 
the one desired. One can check the significance of a computed result beginning 
at position n by also performing a computation at position n +1 or n — 1 and 
comparing the trailing digits produced. 

The most basic interesting constant. whose digits can be computed using this 
scheme is 


1 
k=1 


in base 2. Using this scheme to compute hexadecimal digits of from identity (1.2) 
is only marginally more complicated, since one can rewrite formula (1.2) using four 
sums of the required form. Details are given in the next section. In both cases, in 
order to compute the n-th binary digit (or a fixed number of binary digits at the 
n-th place) we must sum O(n) terms of the series. Each term requires O(log(n)) 
arithmetic operations and the required precision is O(log(n)) digits. This gives 
a total bit complexity of O(n log(n)M (log(n))) where M(j) is the complexity of 
multiplying j bit integers. So even with ordinary multiplication the bit complexity 
is O(nlog®(n)). 

This algorithm is, by a factor of log(log{log(n))), asymptotically slower than the 
fastest known algorithms for generating the n-th digit by generating all of the first 
n digits of log(2) or a [7]. The asymptotically fastest algorithms for all the first 
n digits known requires a Strassen-Schénhage multiplication [19]; the algorithms 
actually employed use an FFT based multiplication and are marginally slower than 
our algorithm, from a complexity point of view, for computing just the n-th digit. Of 
course this complexity analysis is totally misleading: the strength of our algorithm 
rests mostly on its easy implementation in standard precision without requiring 
FFT methods to accelerate the computation. 

It is clear that the above methods can easily be extended to evaluate digits of 
contstants defined by a formula of the form 


_ yo _lk) 
$= Deg 


where p and q are polynomials with integer coefficients and c is a positive integer. 
Similarly if p and qg are slowly growing analytic functions of various types the 
method extends. 


4, COMPUTATIONS 


We report here computations of 7, log(2), log?(2), «? and log(9/10), based 
on the formulas (1.1), (2.2), (2.5), (2.6) and the identity log(9/10) = —L,(1/10), 
respectively. 

Each of our computations employed quad precision floating-point arithmetic for 
division and sum mod 1! operations. Quad precision is supported from Fortran on 
the IBM RS6000/590 and the SGI Power Challenge (R8000), which were employed 
by the authors in these computations. We were able to avoid the usage of explicit 
quad precision in the exponentiation scheme by exploiting a hardware feature com- 
mon to these two systems, namely the 106-bit internal registers in the multiply-add 


227 


228 


14. RAPID COMPUTATION OF POLYLOGARITHMIC CONSTANTS (1997) 


910 DAVID BAILEY, PETER BORWEIN, AND SIMON PLOUFFE 


operation. This saved considerable time, because quad precision operations are 
significantly more expensive than 64-bit operations. 

Computation of ? and log?(2) presented a special challenge, because one must 
perform the exponentiation algorithm modulo k? instead of k. When 7 is larger than 
only 2!%, some terms of the series (2.5) and (2.6) must be computed with a modulus 
k? that is greater than 27°. Squares that appear in the exponentiation algorithm will 
then exceed 25°, which is nearly the maximum precision of IEEE 64-bit floating- 
point numbers. When 7 is larger than 27°, then squares in the exponentiation 
algorithm will exceed 2!°4, which is nearly the limit of quad precision. 

This difficulty can be remedied using a method which has been employed for 
example in searches for Wieferich primes [10]. Represent the running value r in 
the exponentiation algorithm by the ordered pair (r1,r2), where r =r] + kro, and 
where r; and rz are positive integers less than k. Then one can write 


r? = (ry + krg)? =r? + Qryrok + rk’. 


When this is reduced mod k”, the last term disappears. The remaining expression is 
of the required ordered pair form, provided that r? is first reduced mod k, the carry 
from this reduction is added to 2r;r2, and this sum is also reduced mod k. Note 
that this scheme can be implemented with integers of size not exceeding 2k”. Since 
the computation of r? mod k? is the key operation of the binary exponentiation 
algorithm, this means that ordinary IEEE 64-bit floating-point arithmetic can be 
used to compute the n-th hexadecimal digit of 7? or log*(2) for n up to about 
274. For larger n, we still used this basic scheme, but we employed the multiply- 
add “trick” mentioned above to avoid the need for explicit, quad precision in this 
section of code. 

Our results are given below. The first entry, for example, gives the 10°-th through 
10° +13-th hexadecimal digits of x after the “decimal” point. In all cases we did the 
calculations twice — the second calculation was similar to the first, except shifted 
back one position. Since this changes all the arithmetic performed, it is a highly 
rigorous validity check. Thus we believe that all the digits shown below are correct. 

These computations were done at NASA Ames Research Center, using worksta- 
tion cycles that otherwise would have been idle. 


5. LoGs IN BASE 2 
It is easy to compute, in base 2, the d-th binary digit of 
(5.1) log(1 — 27") = £,(1/2"). 
So it is easy to compute log(m) for any integer m that can be written as 
Pega ieee 9 ee ra ate 0 
(2: — 1)(22 —1)--- (2 —1) 
In particular the n-th cyclotomic polynomial evaluated at 2 is so computable. A 


check shows that all primes less than 19 are of this form. The beginning of this list 
is: 


(5.2) 


{2, 3,5, 7, 11, 13, 17, 31, 43, 57, 73, 127, 151, 205, 257} . 
Since 


248 _1=7-9-19-73, 
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Constant: Base: Position: Digits from Position: 


T 16 108 26C65E52CB4593 
107 17AF5863EFED8D 
108 ECB840E21926EC 
10° 85895585A0428B 
1910 921C73C6838FB2 
log(2) 16 10° 418489A9406EC9 
107 815F479E2B9102 
108 E648F40940E13E 
10° BIEEF1252297EC 
ra 16 10° 685554E1228505 
10” 9862837 ADSAABF 
108 4861A AF8F861BE 
10° 437A2BA4A13591 
log?(2) 16 108 2EC7EDB82B2DF7 
107 33374B47882B32 
108 3F55150FLAB3DC 
10° 8BA7C885CEFCES8 
log(9/10) 10 108 80174212190900 
107 21093001236414 
108 01309302330968 
10° 44066397959215 
1018 82528693381274 


and since 7, V9 and 73 are all on the above list we can compute log(19) in SC* 
from 


log(19) = log(2* — 1) — log(7) — log(9) — log(73). 


Note that 21! — 1 = 23-89 so either both log(23) and log(89) are in SC* or neither 
is. 

We would like to thank Carl Pomerance for showing that an identity of type 
(5.2) does not exist for 23. This is a consequence of the fact that each cyclotomic 
polynomial evaluated at two has a new distinct prime factor. We would also like 
to thank Robert Harley for pointing out that 29 and 37 are in SC* in base 2 via 
consideration of the Aurefeuillian factors 22°—! + 2" +1 and 2277! — 2" 41, 


6. RELATION BOUNDS 


One of the first questions that arises in the wake of the above study is whether 
there exists a scheme of this type to compute decimal digits of 7. At present 
we know of no identity like (1.2) in base 10. The chances that there is such an 
identity are dimmed by some numerical results that we have obtained using the 
PSLQ integer relation algorithm [3], [12]. These computations establish (with the 
usual provisos of computer “proofs”) that there are no identities (except for the 
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case n = 16) of the form 


ay 1 ee 1 ag a3 am+1 
es ag ga aa Fre Bra a, eae ny er : 
where n ranges from 2 to 128, where m ranges from 1 to min(n,32), and where 
the Euclidean norm of the integer vector (@9,@1,+-* ,@m41) is 10! or less. These 
results of course do not have any bearing on the possibility that there is a formula 
not of this form which permits computation of 7 in some non-binary base. 

In fact, J. P. Buhler has reported a proof that any identity for 7 of the above 


K 
form must have n = 2" orn =/2 . This also does not exclude more complicated 
formulae for the computation of 7 base 10. 


7. QUESTIONS 


As mentioned in the previous section, we cannot at present compute decimal 
digits of by our methods because we know of no identity like (1.2) in base 10. 
But it seems unlikely that it is fundamentally impossible to do so. This raises the 
following obvious problem: 

1] Find an algorithm for the n-th decimal digit of 7 in SC*. It is not even clear 
that z is in SC in base 10 but it ought to be possible to show this. 

2] Show that a is in SC in all bases. 

3] Are e and 2 in SC (SC*) in any base? 

Similarly the treatment of log is incomplete: 

4] Is log(2) in SC* in base 10? 

5] Is log(23) in SC* in base 2? 
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Synopsis: 

Ever since antiquity, mathematicians have wondered whether numbers such 
as mw are rational, the quotient of two integers. In fact, one prime motivations 
in computations of 7 through the centuries was the hope that the digits produced 
might provide some insight into this question. In 1761, Lambert settled the question 
for 7, but the irrationality of some other constants remains in question. 

In this paper, the author presents, in a single proof, irrationality proofs for 
m, In2, ¢(2) = S072, 1/k? and ¢(3) = OP. 1/k?. These proofs are surprisingly 
accessible — for the most part they involve only operations of calculus, together 
with some properties of Legendre polynomials. 
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Similarities in Irrationality Proofs 
for 7, In2, C(2), and C(3) 


Dirk Huylebrouck 


1. FOUR REMARKABLE NUMBERS. The first two numbers, 7 and In2, are fa- 
miliar to high school graduates. Their expressions as series are standard: 


1 1 1 . i 
Sects =) —1))- —— Leibniz’ serie: 
1 5 he 5 + d( } Wel (Leibniz’ series) 


re ae, 
ce penk Gye ec Ny 009 Pe z0 
nests oot y(-D ; = 0.693 


i=1 
Similar expressions define the less familiar ¢ (2) and ¢ (3): 


i od 1 1 
C(Dalt ot yt tytee = g = 164493... 


2 
2 i>] 


1 1 1 1 
and Ae Nig tga re pg tint Pg OD 
The irrationality of 2 dominated a good 2000 years of mathematical history, starting 
with the closely related circle-squaring problem of the ancient Greeks. In 1761 Lam- 
bert proved the irrationality of z (Lindemann would complete the transcendence proof 
in 1882 [2, pp. 52 and 172}). The interest in ’s younger brother ¢(3) started only a 
few centuries ago, but the number resisted until 1978, when R. Apéry presented his 
‘miraculous’ proof [14]. Even after Apéry’s lecture, scepticism remained general, un- 
til Beukers’ simplified version confirmed it [3]. The character of the ¢-numbers still 
fascinates the mathematical community, and even very recently it was upset by results 
of Tanguy Rivoal (communication J. Van Geel, University of Ghent). 


Essential in the simplified proofs are the representations of ¢(2) and ¢(3) as inte- 
grals. Since 


1 1 1 = 
dxdy = If x'y'dxdy 
iH \—xy : 0 » 


iz0 


1 t 
= ie | dx [ yidy 
0 0 


i>0 


Ss itt 1 yit! 25 1 
~ & i+1], Li+1 oe G +1)?’ 


i>0 
ts 1 
C2) = | / dxdy. 
o 1—xy 
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Similarly for ¢ (3): 


1 
[<x i Pe ls ae = i be y'Inxydxdy = aff So x'y'Inxdxdy 
Q 0 


i>d i>0 
=f x'inxdx [ yidy 
iro 70 
j ' 
= Ss (Ga ce a) Je 
mer i+ 


1 1 1 1 
= (-ma) ma = Geip 


1 1 
@=-5 ff; 


Incidentally, the integral for ¢(2) can be evaluated very easily. This computation is 
not really needed here, but it provides an illustration of a calculation involving a zeta 
expression. The easiest components of Apostol’s, Beuker’s, and Kalman’s results are 
combined, and except for some elementary knowledge about double integrals, no other 
prerequisites seem needed: see [1], [3], [10]. The combination provides a proof at 
graduate level for Euler’s ¢(2) result; see [4] for a more rigorous and general proof. 
First, rewrite the difference of two integrals using the substitution X = x? and 


7 [Eh G4e-rts)eo= ff (E2e)ew 


= dxdy. 1 
IL (se) w 
Next, obtain their sum: 


1 1 1 ! 1 
dxdy =2 soe ivy: 2 
I (<s'rs) ae II [x2 y? ee 


Adding (1) and (2) gives 


4 1 | } 1 
2 dxdy = — adXdY +2 ——.— dxdy, 
If l1—xy nee all 1— XY " If 1 —x?y? ee 


Thus, 


1 
Inxydxdy. 


2¢(2) = 5) 42 / [x ittdy. 


350) = i [<r pitty 
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Substituting x = sin@/cos@ and y = sing/cos@ yields a Jacobian equal to 
1 — tan’ 6 tan? # and this is the denominator of the integrand. 


m 


72) = ah [a wey y 5 (1 — tan’ 6 tan” y) dy 
sin sing 


4 2 ae x? 
== | a ie a, 
4] if ak tame 


This is Euler’s well-known result: ¢(2) = 17/6. 


2. FOUR PROOFS IN ONE. The Borweins collected irrationality proofs for these 
four numbers [5, pp. 353, 366, 369, and 370}, in a very rigorous treatise. In order to 


make their similarity more evident, we first summarise the highlights of the demon- 
strations in general terms. 


Suppose the irrationality of a number € must be shown. In the four cases we present 
here, a family of integrals (j € N) concerning that number is proposed: 


1 
i x! f(x)dx = RK; + Sjé, 
0 


where R; and S; € Q; f is an unknown function. 
Now if € were a fraction a/b, this family of integrals would yield rational expres- 


sions f x! f(x)dx = C;/D;, where C; and D; € Z. Integer multiples of these inte- 
grals and their sums would again exhibit this property. Thus, if p,; € Zand n € N: 


1 on : n E, 
[ YS prix! f @)dx = Yow fi xf f(x)dx = yg D; = S 5 
o = j=0 


j=0 Fn 


with again E, and F, € Z. 
Apply this property to the Legendre polynomials: 


P(x) = 


= Prix! 
j=0 


For example, Po(x) = 0, P\(x) = 1 ~— 2x, P?(x) = 2 — 12x — 12x?. Note that 
Prj EZ, 
Thus, for the given family of integrals, 


! A 
i Pr) ff O)dx = =, 
i) Br 
with A,, B, € Z. 
The choice of Legendre polynomials is inspired by the possibility of performing 


integrations by parts easily. Indeed, since P,(0) = 0 = P,,(1), and similarly for the 
derivatives of x"(1 — x)" up to the order n, many terms can be simplified: 


1 I ld d*- 1 , 
| Py(x) f()dx = [ — ( Soa el 0" ) fordx 
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1 n—1 1 


d n n 
= Freee (i ~ x) » £00] 


0 


lida |, af (x) 
D Rigede Ne 


1 n—-1 
-0- | bd (x" (1 — x)") aI) 
dx 


9 nidxr! 


Integrating by parts n times leads to 


1 in 
[ aro-o a. 
o n! dx" 


In the four encountered cases the function f(x) happens to be such that 
My d" ry 
[ o0- xp as| [ peor mena, 
om dx" io on! 
where the maximum value M of g(x) is small enough to ensure that 


> 0, 


1 
B,M" : h(x)dx 
0 


In addition, all the integrals 4 P,,(x) f («)dx are non-zero, and this immediately im- 
plies the irrationality of &: 


0<|A,| = < +0. 


1 
By i P(x) f (xx 


1 
B,M" fe h(x)dx 
0 


Indeed, for any n € N, |A,| is a positive integer so this is impossible; € cannot be 
rational. 


Of course, the difficulty in the proofs lies in the appropriate choice of f(x). It 
must yield a family of non-zero integrals whose members are easily expressed as a 
combination rational number and &. In addition, it should simultaneously be possible 
to maximize their product with an integer that becomes larger and larger, and still 
ensure the indicated convergence to 0. 


3-I. Irrationality of 2 Take f(x) = sin(@rx). It is a standard calculus exercise 
to show that the members of family of integrals of the form f xf sin(rx)dx are 
polynomials in a of degree at most j, divided by z/. The linear combinations 
i P.,(x) sin(x)dx are non-zero, and thus, if 2 were the rational number a/b: 


0 < |A,| = 


1 
a | P,,(x) sin(ax)dx 
0 


n 


nA : 1 “zal n d - d 
a : Pa — x) Gyn SX) x 


s 


1 
1 
a | ae —x)"n"dx\, 
0 : 


since the n-th order derivative of sin(zx) is kz” sin(rx) or 7” cos(zx). The max- 
imum value of x(1 — x) is 1/4, attained at x = 1/2. Thus, the final expression is less 
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nt (1\" ny 
ala) OP 


which is arbitrary small for large values of n; see [13]. 


than 


3-If. frrationality of In2 The choice is f(x) = 1/(1 +x). An Euclidian division of 
x/ by 1+ allows us to compute the family of integrals 


1x 1 1 
dx =~ — ~~ +--+ 1412. 
o l+x iojr-l 


If lIn2 were a/b, then 


1 
0 < |A,| = Koa) Pa) 4 (where d,, = LCM{1, 2,3,...,}) 


n a a” 1 
=|ouy [2 —x a —x) mp G3) as| 
xd — 1 
sla f Gesx 1+x >) at. 


since the n-th order derivative of 1/(1 + x) is (—1)...(—n)(1/(1 + x))"*!. Now on 
(0, 1}, the maximum value of x(1 — x)/(1 + x) is 3 — 2/2, achieved atx = —1 + J/2. 
A rough inequality from number theory is d, = LCM{1,...,n} < 3". Finally, since 
(3(3 — 2./2))” < 1, the irrationality of In2 is established; see (5, p. 370]. 


3-III. Irrationality of ¢(2) The choice is 
1 i ) 
f= fa 
0 — xy 
Each member of the family of integrals 
1 : 1 (a= n 
i, x ier ay| dx 
0 o 1—xy 


is a sum of integrals of the form 


1 ras 
I ** aydx, withr, s EN. 
9 L-xy 


These can again be computed through an Euclidian division of x/ y* by 1 — xy, which 
gives a sum of integrals of the form 


1 1 xP 1 y4 1 
If xP yldydx, If dydx, | ————dydx or i 
0 go l~xy 9 L-xy o | 


The latter is ¢(2), while the others are sums of fractions, which can be computed using 
partial integration for the integral f x"Inxdx. Thus when r # 5, 


226 © THE MATHEMATICAL ASSOCIATION OF AMERICA [Monthly 108 


239 


240 15. SIMILARITIES IN IRRATIONALITY PROOFS FOR 7, In2, ¢(2), AND ¢(3) (2001) 


i i saya 


is a sum of fractions whose common denominator is the square of least common mul- 
tiple (LCM) of the first 7 + 1 integers. When 7 = s, the integral equals 


1 \ 
Da at@-(1e--+ 5). 


i>r 


Thus, 
1 
Ap 
P(x) f a)dx| = , > , ’ 
0 div 
where d,,1 is the LCM of the first n + 1 natural numbers and A, € Zo. 
Now 
1 
0< [dnl = ld [Pad fdas 
0 


| qd" ‘Taq-yy 
= |d* —x"(1— x)" dy}d 
a, [ Sera Fs (fay) ax 


1 = n 
= aif = —<2"(L a)" 1) sae mf Jt 2 dvds 
0 


—xy)rt 


" x(1—x)yQ =») 1 | 
= ld —— 3 
iA ( (— xy) er ged " 


On (0, 1], the maximum value of x(1 — x)y(1 — y)/(1 — xy) is (—1 + V5) /2)%, and 
is attained for x = y = (~-1+ /5) /2. Together with d,4, < 3"*!, this shows that the 
expression (3) is less than 


Seg (PREY) 
(3 | (4 )} yo 


which establishes the irrationality of ¢(2); see [3}. 


9(-1 +03) 
95 


= 19 £(2)| < 1, 


3~IV. Irrationality of ¢(3) Take 


soy= fi ee Inxy dxdy. 


The members of the family of integrals 


1 
[ “(fo Faly) > tnxy dey ax 
0 0 


are computed through the derivative of 
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Lopr+tt yet 
xy 
dxdy 
o l-xy 
1 yrttyjstt 
xy 
Inxy dxdy. 
90 1—xy 


Ifr % s, this is asum of fractions since d(r + t)""/dt = —m/(r + t)~”~!, and the 
LCM of the denominators is (d,,)°. 
When r = 5s, the result is 


Se dt eaFe gS ren ma 


i>r+t t>r+t 
1 1 1 
=2(c0- (1+ Tat ey ee) 


|An| 


(al n+] 


with respect to t, which is 


Thus, 


? 


if Pi(x)f @)dx| = 


where A, € Z and d,,1 is the LCM of the first n + 1 natural numbers. 


Now 
1 
[An] = teh P,(x) if PnQ) PX insyay| as 
0 0 _ xy 


rst / | i: ae D axdyde (integration by parts) 
x"(1 — x)" Paly)y"z _ l-z 
“elit G= Cay “Gata ayey eae (pet ~1=0 ye, 
= dns / If ss et ge dxdydw (integration by parts) 
—(—xy 


wd —xyry"(— yyw" — wy" 
die fit Gd —(—xyywyt es aw 


eae een i | 
~ slacfif ( 1—(—xyyw iam 


The maximum value of x(1 — x)y(1 — y)w(1 — w)/(i — (i — xy)w) on [0, 1] is 
(/2 — 1)4, and is attained for x = y = —1 + ¥2 and z = 1/¥2. Together with 
Gn41 <3", this shows that the expression (4) is less than 


ry / / i (v2- 1)" vt = |o7 (27 (v3 = i) | 


<1, 


which establishes the irrationality of ¢(3), see [3]. 
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4. THE GOLDEN SECTION. In our proofs, the maximum values of certain func- 
tions play an important role. They were attained at the points 1/2, (~1 + V5)/2, and 
—1+ ./2. There is a coincidental link since they are all related to the golden section. 

Classically, the golden section $ arises when a line segment of length x greater than 
1 is divided into two parts. This could be done by cutting it into two halves (recall that 
1/2 popped up in the irrationality proof of zr). Or, if unequal segments are desired, 
one could look for two pieces of lengths 1 and x — 1, such that the ratio x/1 equals 
the ratio 1/(x — 1). This equality produces the quadratic equation x” — x — 1 = 0, of 
which 1.6180... = @ is the positive solution. 

More generally, the positive roots of x? — nx — 1 = 0 yield the family of metallic 
means, for various values of n € N. For n = 2, we get the silver mean 04, = 1+ V2, 
for n = 3 the bronze mean op, = (3 + V13)/2, etc. The properties of these numbers 
have been described in numerous publications; for a comprehensive survey, see [6], 
while [8} and [12] pointed out that some authors often had too much enthusiasm. A 
common misconception is that a rectangle of width 1 and length @ would be the “most 
elegant” one and thus is used in various designs. However, no reliable statistical studies 
confirm this statement about the optimal choice provided by the golden number [77]. 

A statement that comes close is the fact that @ would be “the most irrational of all 
irrational numbers” because its representation as a continued fraction contains only Ls: 


1 
i+— 


g=1+ =i, lscd, 


The silver mean would be “the second most irrational number” since o4, = [2,2,...], 
etc.; see [6]. Yet, this again does not provide an interpretation of the golden section an 
optimal solution, in the standard mathematical sense. 

However, the various irrationality proofs lead to a property of these metallic means 
where the expression “optimal solution” has its common mathematical meaning. 
Table 1 illustrates some interesting facts: 


TABLE 1. 
Proof Function used Maximum Attained at Name 
ae ae -! 
ca x(1 — x) x= Z x= 5 2 
ny 205” 3-2/2 x=-1+2 Ag 
(ra) 7 
5 
2) x(1 ~ x)y( — y) ea +45) = (-1+¥5) -¢" 
gC “aoa et ee 5) 
x(1—x)y(1 — y)w(1 — w) ‘i ee te acid 
é(3) = (ibe. 7 (-1+ v2) x=y= 1+V2 z= Fag 


A substitution of X = —1/x and ¥Y = —1/yinx(1 —x)y(1 — y)/(1 — xy) changes 
the expression into (1 + x)(1 + y)/(xy(vy — 1)). Its extremum is obtained at X = 
Y = ¢. Similarly, o4, provides the optimal solution to (X — 1)/((X + 1)X). In these 
cases, the word “optimal” is used in the usual mathematical way, in contrast to the 
loose terms often used in golden section papers. The geometric interpretation of these 
facts is developed in a forthcoming text [9]. 
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There are other links between ¢(2) and ¢(3), and the metallic means. For example, 
in the easy proof for Euler’s £(2) result, hyperbolic sines and cosines (x = sinh@/ 
cosh @ and y = sinh @/ cosh) can be substituted instead of the similar expressions 
with circular sines and cosines. In that case, 


+00 


£(2) = (in (i ie v2) +2 | (¢ — argcosh (sinh t)) dt. 
in(1+-/2) 


Now the silver section 1 + /2 appears, while in Table 1, ¢ (2) was already linked to the 
golden section. More involved computations relate ¢(3) to the golden section, too (11, 
p. 156]: 


inc Hy) 
¢(3) = 10 [ t? cosh tdt. 
0 


These relations did not inform us about the “optimal” properties of the metallic num- 
bers, and we give them here for sake of completeness. 
Incidentally, since 


= (1 — xy) eae ee 
c= ff{ff Gad-mmd=demn ee 


a natural generalization of the functions used in the study of the irrationality of ¢(2) 
and £(3) would be 


x(1—x)y — yyw — wu — 0) — xy) 
(1-1 — xy)w) — (1 — xy)v) 


The maximum of this function is 


(5- v3) (-7+ 2vi3) 
54(-3 + /13)4 


obtained for z = w = (1 — /G@y))/(1 — xy) and x = y = (-3 + V/13)/2. Here 
again a metallic mean is found, and it is the next one, —oz,~'. Unfortunately, it does 
not provide a proof for the irrationality of ¢(4) (and by extension for ¢(5)) since the 
members of the family of integrals are not combinations of ¢ (4) with rational numbers. 
The quest for these proofs remains open. 
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Solution to Sherwood Forest Puzzle on p. 143 of the February issue: 


The idea of ‘singer of a person’s song to another’ can be taken as an operation *. 
that is, “x * y = 2” denotes “x’s song is sung to y by z”. Thus: 


1) we have closure of « (everyone’s song is sung to everyone by a singer), 
2) we have an ‘identity’ (the priest), 
3) we have ‘inverse’ for everyone (mates), 


4) the (unavoidably) cryptic third paragraph is the associativity of the opera- 
tion *. 


Thus we have a group structure. But, 10,201 = 101 x 101 and 101 is a prime, 
and any group of order of square of a prime is abelian. 

Thus, it was Marian who sang Little John’s song to Robin. For the second 
question, it suffices that we have a group structure since we have Marian = 
Robin * Little John. We should recall Marian and Robin are mates and so we 
can left multiply the equation by Robin’s inverse to get Marian * Marian = 
Little John. And so Little John sang to Marian her song. 
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Paper 16: Jeremy Gibbons, “Unbounded spigot algorithms for the digits of pi,” 
American Mathematical Monthly, vol. 113 (2006), p. 318-328. Copyright 2006 
Mathematical Association of America. All Rights Reserved. 


Synopsis: 

In 1995, Rabinowitz and Wagon presented a “spigot” algorithm for 7, in a 
paper included earlier in this volume. One limitation of the Rabinowitz-Wagon 
algorithm was that the computation was inherently bounded — one has to decide 
in advance that one will compute up to a certain number of digits. 

In this paper, Gibbons presents a different spigot algorithm, based on the same 
infinite series that lies behind the Rabinowitz-Wagon algorithm, but which avoids 
this limitation. One does not need to commit in advance to compute a certain 
maximum number of digits. In theory, the algorithm could continue to gener- 
ate decimal digits of 7 indefinitely. Gibbon’s algorithm is not as simple as the 
Rabinowitz-Wagon algorithm, but it still can be stated (and implemented) very 
concisely. 


Keywords: Algorithms 
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Unbounded Spigot Algorithms 
for the Digits of Pi 


Jeremy Gibbons 


1. INTRODUCTION. Rabinowitz and Wagon [8] present a “remarkable” algorithm 
for computing the decimal digits of 2, based on the expansion 


= Oo (@72!4! 
we ys Qi+ DE “ 


Their algorithm uses only bounded integer arithmetic and is surprisingly efficient. 
Moreover, it admits extremely concise implementations. Witness, for example, the 
following (deliberately obfuscated) C program due to Dik Winter and Achim Flam- 
menkamp [I, p. 37], which produces the first 15,000 decimal digits of z: 


a[52514] ,b,c=52514,d,e,f=1e4,¢,h; 
main(){for(;b=c-=14;h=printf("%04d", e+d/f)) 
for (e=d,=f ; g=--b*2;d/=g) d=d*btf* (h?alb] :£/5) ,alb]=d%--g;} 


Rabinowitz and Wagon call their algorithm a spigot algorithm, because it yiclds 
digits incrementally and does not reuse digits after they have been computed. The 
digits drip out one by one, as if from a leaky tap. In contrast, most algorithms for 
computing the digits of x execute inscrutably, delivering no output until the whole 
computation is completed. 

However, the Rabinowitz-Wagon algorithm has its weaknesses. In particular, the 
computation is inherently bounded: one has to commit in advance to computing a 
certain number of digits. Based on this commitment, the computation proceeds on an 
appropriate finite prefix of the infinite series (1). In fact, it is essentially impossible 
to determine in advance how big that finite prefix should be for a given number of 
digits—specifically, a computation that terminates with nines for the last few digits of 
the output is inconclusive, because there may be a “carry” from the first few truncated 
terms. Rabinowitz and Wagon suggest that “in practice, one might ask for, say, six 
extra digits, reducing the odds of this problem to one in a million” [8, p. 197], a not 
entirely satisfactory recommendation. Indeed, the implementation printed at the end 
of their paper is not quite right [1, p. 82}, sometimes printing an incorrect last digit 
because the finite approximation of the infinite series is one term too short. 

We propose a different algorithm, based on the same series (1) for 7 but avoiding 
these problems. We also show the same technique applied to other characterizations 
of x. No commitment need be made in advance to the number of digits to be com- 
puted; given enough memory, the programs will generate digits ad infinitum. Once 
more (necessarily, in fact, given the previous property), the programs are spigot algo- 
rithms ijn Rabinowitz and Wagon’s sense: they yield digits incrementally and do not 
reuse them after producing them. Of course, no algorithm using a bounded amount 
of memory can generate a nonrepeating sequence such as the digits of w indefinitely, 
so we have to allow arbitrary-precision arithmetic, or some other manifestation of dy- 
namic memory allocation. Like Rabinowitz and Wagon’s algorithm, our proposals are 
nol competitive with state-of-the-art arithmetic-geometric mean algorithms for com- 
puting wz [2], [9]. Nevertheless, our algorithms are simple to understand and admit 
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almost as concise an implementation. As evidence to support the second claim, here 


is a (deliberately obscure) program that will generate as many digits of 7 as memory 
will allow: 


> pi = g(1,0,1,1,3,3) where 

> g(q,r,t,k,n,1) = if 4*qtr-t<n+t 

> then n : g(10#q,10*(r-n#t) ,t,k,div(10*(3*q+r) )t-10*n,1) 

> else g(q*k, (Qeqtr)*1,t*1,k+1,div(q* (7*k+2)+r*1) (t*1) ,1+2) 


The remainder of this paper provides a justification for the foregoing program, and 
some others like it. 

These algorithms exhibit a pattern that we call streaming [3]. Informally, a stream- 
ing algorithm consumes a (potentially infinite) sequence of inputs and gencrates a 
(possibly infinite) sequence of outputs, maintaining some state as it goes. Based on the 
current state, at cach step there is a choice between producing an element of the out- 
put and consuming an element of the input. Streaming seems to be a common pattern 
for various kinds of representation changers, including several data compression and 
number conversion algorithms. 

The program under discussion is written in Haskell [5], a lazy functional program- 
ming language. As a secondary point of this paper, we hope to convince the reader 
that such languages are excellent vehicles for expressing mathematical computations, 
certainly when compared with other general-purpose programming languages such as 
Java, C, and Pascal, and arguably even when compared with computer algebra systems 
such as Mathematica. In particular, a lazy language allows direct computations with in- 
finite data structures, which require some kind of indirect representation in most other 
languages. The Haskell program presented earlicr has been compressed to compete 
with the C program for conciseness, so we do not argue that this particular one is easy 
to follow—but we do claim that the later Haskell programs are. 


2. LAZY FUNCTIONAL PROGRAMMING IN HASKELL. To aid the reader’s 
understanding, we start with a brief (and necessarily incomplete) description of the 
concepts of functional programming (henceforth FP) and laziness and their manifesta- 
tion in Haskell [5], the de facto standard lazy FP language. Further resources, including 
pointers to tutorials and free implementations for many platforms, can be found at the 
Haskell website [6]. 

FP is programming with expressions rather than statements. Everything is a value, 
and there are no assignments or other state-changing commands. Therefore, a pure FP 
language is referentially transparent: an expression may always be substituted for one 
with an equal value, without changing the meaning of the surrounding context. This 
makes reasoning in FP languages just like reasoning in high-school algebra. 

Here is a simple Haskel) program: 


> square :: Integer -> Integer 
> square x = x * xX 


Program text is marked with a “>” in the left-hand column, and comments are un- 
marked. This is a simple form of literate programming, in which the emphasis is 
placed on making the program easy for people rather than computers to read. Code 
and documentation are freely interspersed; indeed, the manuscript for this article is 
simultaneously an executable Haskell program. 

The first line in the square program is a type declaration; the symbol “: :” should 
be read “has type.” Thus, square has type Integer -> Integer, that is, it is a 
function from Integers to Integers. (Type declarations in Haskell are nearly always 
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optional, because they can be inferred, but we often specify them anyway for clarity.) 
The second line gives a definition, as an equation: in any context, a subexpression of 
the form square x for any x may be replaced safely with x * x. 

Lists are central to FP. Haskell uses square brackets for lists, so [1,2,3] has 
three elements, and [] is the empty list. The operator “:” prefixes an element; 
so [1,2,3] = 1: (2: (3: [))). For any type a, there is a corresponding 
type [a] of lists with elements drawn from type a, so, for example. [1,2,3] 
[Integer]. Haskell has a very convenient list comprehension notation, analogous 
lo sct comprehensions; for instance, [ square x | x <- [1,2,3] ] denotes the 
list of squares [1,4,91]. 

For the “substitulion of equals for equals” property to be universally valid, it is 
important not to evaluate expressions unless their values are needed. For example, 
consider the following Haskell program: 


> three :: Integer -> Integer 
> three x = 3 


> nonsense :: Integer 
> nonsense = 1 + nonsense 


The first definition is of a function that ignores its argument x and always returns 
the integer 3; the second is of a value of type Integer, but one whose evaluation 
never terminates. For substitutivity to hold, and in particular for three nonsense to 
evaluate to 3 as the equation suggests, it is important not to evaluate the function argu- 
ment nonsense in the function application three nonsense. With lazy evaluation, 
in which evaluation is demand-driven, no expression is evaluated unless and until its 
value is needed to make further progress. 

A useful by-product of lazy evaluation is the ability to handle infinite data struc- 
tures: they are evaluated only as far as is necessary. We illustrate this with a definition 
of the infinite sequence of Fibonacci numbers: 


> fibs :: [Integer] 
> fibs = f (0,1) where f (a,b) =a: f (b,atb) 


(Here, (0,1) is a pair of Integers, and the where clause introduces a local defini- 
tion.) Evaluating fibs never terminates, of course, but computing a finite prefix of it 
does. For cxample, with 


> take :: Integer -> [Integer] -> [Integer] 
> take 0 xs = [] 

> take (nti) 0 = O 

> take (nti) (x:xs) = x : take n xs 


(so take takes two arguments, an Integer and a [Integer], and returns another 
[Integer]), we have take 10 fibs = [0,1,1,2,3,5,8,13,21,34], which 
terminates normally. 

Functions may be polymorphic, defined for arbitrary types. Thus, the function take 
in fact works for lists of any element type, since elements are merely copied and 
not further analyzed. The most general type assignable to take is Integer -> [a] 
-> [a] for an arbitrary type a. 

Because this is FP, functions are “first-class citizens” of the language, with all the 
rights of any other type. Among other things, they may be passed as arguments to and 
returned as results from higher-order functions. For example, with the definition 
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> map :: (a->b) -> [a] -> [b] 
>mapf ) = 0 
> map f (x:xs) = fx : map f xs 


the list comprehension [ square x ] x <- [1,2,3] ] is equal tomap square 
{1,2,3]. Note that map and take are curried. In fact, the function type former -> 
associates to the right, so the type of map is equivalent to (a->b) -> ([a]->[b]), 
which one might read as saying that map takes a function of type a->b and transforms 
it into one of type [a] ->[b]. 


3. RABINOWITZ AND WAGON’S SPIGOT ALGORITHM. Rabinowitz and 
Wagon’s algorithm is based on the series (1) for 7, which expands out to the expres- 


T ssseifebslobess())))» 


This expression for z can be derived from the well-known Leibniz series 


mes (=D! 
rie Sram io 


using Euler’s convergence-accelerating transform, among several other methods [7]. 
One can view expression (2) as representing a number (2; 2, 2,2, ...) ina mixed-radix 
base B = (4, 2, 3,...), in the same way that the usual decimal expansion 


| i 
T23e—(1— 442 
: agitate (tt gst ))) 


represents (3; 1,4, 1,5, ...) in the fixed-radix base Fjo, where F,, = (4,4, 4,...). 
The task of computing the decimal digits of m is then simply a matter of converting 
from base B to base Fio. 

We consider regu/ar representations. For a regular representation in decimal, every 
digit after the decimal point is in the range [0, 9}. The decimal number (0; 9,9, 9, ...) 
with a zero before the point and maximal digits afterwards represents 1. Accordingly, 
regular decimal represcntations with a zero before the point lic between 0 and 1. 
By analogy, for a regular representation in base 5, the digit in position 7 after the 
point (the first after the point being in position 1) is in the range [0, 22]. The number 
(0; 2, 4, 6, ...) with zero before the point and maximal digits afterwards represents 2. 
Accordingly, regular base & representations with zero before the point lie between 0 
and 2. (We call the representations “regular” rather than “normal” because they are not 
unique.) 

Conversion from base B to decimal proceeds as one might expect. The integer part 
of the input becomes the integer part of the output. The fractional part of the input 
is multiplied by ten; the integer part of this becomes the first output digit after the 
decimal point, and the fractional part is retained. This is again multiplied by ten; the 
integer part of this becomes the second output digit after the point; and so on. 

Multiplying a number in base 6 by ten is achieved simply by multiplying each 
digit by ten. However, that yields an irregular result, because some of the resulting 
digits may be too big. Regularization proceeds from right to left, reducing cach digit 
as necessary and propagating any carry leftwards. 
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The only remaining problem is in computing the integer part of a number (ap; a), 
a>, a3) in base B. This is either ag or ag + 1, depending on whether the remainder 
(0; a), @2, a3) is in [0, 1) or [1, 2). Un principle, the remainder could equal 2; but, in 
practice, this cannot happen in the computation of an irrational such as 7.) Therefore, 
Rabinowitz and Wagon’s algorithm temporarily buffers any nines that are produced, 
until it is clear whether or not there could be a carry that would invalidate them. 

This whole conversion is performed on a finite number (2; 2,2,2,...,2) in 
base B—necessarily, as regularization proceeds from right to Jeft. Rabinowitz and 
Wagon provide a bound on the number of base 6 digits needed to yield a given num- 
ber of decimal digits. In fact, as mentioned earlier, they underestimate by one in some 
cases: |10n/3] digits is usually sufficient, but sometimes |10”n/3| + 1 input digits 
is necessary (and sufficient) for n decimal digits. (Here, “|x|” denotes the greatest 
integer not larger than x.) Again, as noted, this does not mean that those n decimal 
digits are all correct digits of 7, only that the n-digit decimal number produced is 
within 5 x 107” of the desired result. 


4. STREAMING ALGORITHMS. We turn now to streaming algorithms, by way of 
a simpler example than computing the digits of z. Consider the problem of converting 
a fraction in the mterval [0, 1] from one base to another. We represent fractions as 
digit sequences, and for simplicity (but without loss of generality) we consider only 
infinite sequences. For this reason, we cannot consume all the input before producing 
any output; we must alternate between consumption and production. The computation 
will therefore maintain some state depending on the inputs consumed and the outputs 
produced thus far. Based on that state, it will either produce another output, if that is 
possible given the available information, or consume another input if it is not. This 
pattern is captured by the following higher-order function: 


> stream :: (b->c) -> (b->c->Bool) -> (b->c->b) -> (b->a->b) 
> -> b -> [a] -> [c] 

> stream next safe prod cons z (x:xs) 

> = if safe zy 

> then y : stream next safe prod cons (prod z y) (x:xs) 

> else stream next safe prod cons (cons z x) xs 

> where y = next z 


This defines a function stream taking six arguments. The result of applying stream 
is an infinite list of output terms, each of type c. The last argument x: xs is a list of 
input terms, each of type a; the first element or “head” is x, and the infinite remainder 
or “tail” is xs. The penultimate argument z is the state, of type b. The other four 
arguments (next of type b->c, safe of type b->c->Bool, prod of type b->c->b, 
and cons of type b->a->b) are all functions. From the state z the function produces 
a provisional output term y = next z of type c. If y is safe to commit to from the 
current state z (whatever input terms may come next), then it is produced, and the state 
adjusted accordingly using prod; otherwise, the next term x of the input is consumed 
into the state. This process continues indefinitely: the inpul is assumed never to run 
out, and, if the process is productive, the output never terminates. 

In the case of conversion from an infinite digit sequence in base F,, to an infinite 
sequence in base F,,, clearly both the input and output elements are of type Integer. 
The state maintained is a pair (4, v) of Rationals, satisfying the “invariant” (that 
is, a property that is established before a loop commences and is maintained by each 
iteration of that loop) that the original input 
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when é input terms Xo, x,,...,%;-, have been consumed and j output terms yo, y), 
..+, ¥y-1 have been produced, Initially 7 and j are zero, so the invariant is established 
with uw = O and v = 1. In order to maintain the invariant, the state (uv, v) should be 
transformed to (u — y/(m v), av) when producing an additional output term y and to 
(x +um, v/m) when consuming an additional input term x. The value of the remain- 
ing input 


ranges between 0 and 1, so the next output term is determined provided that avu and 
nu(u + 1) have the same integer part or floor. This justifies the folowing streaming 
algorithm: 


> convert :: (Integer,Integer) -> [Integer] -> [Integer] 
> convert (m,n) xs = stream next safe prod cons init xs 
> where 


> init = (O%1, 1%1) 
> next (u,v) = floor (u*v*n’) 
> 


safe (u,v) y = (y == floor ((ut1)*v#n’)) 

prod (u,v) y = (u - fromInteger y/(v*n’), v*n’) 
> cons (u,v) x = (fromInteger x + u*m’, v/m’) 
> (m? jn’) = (fromInteger m, fromInteger n) 


(Here, “%” constructs a Rational from two Integers, “==” is the comparison oper- 
ator, and the function fromInteger cocrces Integers to Rationals.) 

For example, 1/e is 0.1002210112... in base 3 and 0.2401164352... in base 7. 
Therefore, applying the function convert (3,7) tothe infinite list [1,0,0,2,2,1, 
0,1,1,2... should yield the infinite list [2,4,0,1,1,6,4,3,5,2.... The first 
few states through which execution of this conversion proceeds are illustrated in the 
following table: 


input | I 0 0 2 2 1 

sate |S) DL 21 819 2 Ae 7 ae 0 a O38 
] 123. Oe OR Fe OT 7? 81 7? 243 7° 729 49" 729 49" 729 

output 2 4 0 


The middle row shows consecutive values of the state (#, v). The upper row shows 
input digits consumed, above the state resulting from their consumption. The lower 
row shows output digits produced, below the state resulting from their production. 
Notice that outputs are produced precisely when the corresponding state (u,v), which 
is in the previous column, is safe; that is, when |7uv| = |7(u + lv. For example, 
the fourth state (, +) is safe, because [7 x 2 x =| = 2 = [TX G +1)*x 5]. 
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This paper is not the place to make a more formal justification for the correctness 
of this program, although it is not hard to establish from the invariant stated. Neverthe- 
less, it is possible to derive the streaming program from a specification expressed in 
terms of independent operations for expanding and collapsing digit sequences, using a 
general theory of such algorithms [3]. (In the general case, either the input or the out- 
put or both may be finite. We have stuck to the simple case of necessarily-infinite lists 
here, because that is all that is needed for computing the digits of 7.) We have found 
this pattern of computation in numerous problems concerning changes of data repre- 
sentation, of which conversions between number formats are a representative example. 
Consequently, we have been calling such algorithms metamorphisms. 


5. ASTREAMING ALGORITHM FOR THE DIGITS OF z. The main problem 
with Rabinowitz, and Wagon’s spigot algorithm is that it is bounded: one must make 
a commitment in advance to the number of terms of the serics (1) to use. This com- 
mitment arises because the process of regularizing a number in base B proceeds from 
right to left, hence works only for finite numbers in that base. 

It turns oul that there is a rather simple streaming algorithm for regularizing infinite 
numbers in base 3. This means that we can make Rabinowitz and Wagon’s algorithm 
unbounded: there is no longer any need to make a prior commitment to a particular 
finite prefix of the expansion of 2. However, we will not say any more about this ap- 
proach, for there is a more direct way of computing the digits of x from the expression 
(2), to which we now turn. 

One can vicw the expansion (2) as the composition 


= (245 )Q2+3 ee ) 2+ ). (4) 
a a 3X) ( 7* ma era ae 


of an infinite series of finear fractional transformations or Mobius transformations. 
These are functions taking x to (gx + r)/(sx +1) for integers g, r, s, and ¢ with 
qgt—rs ~ 0—that ts, yielding a ratio of integer-coefficient linear transformations of 
x. Such a transformation can be represented by the four coefficients g, r,s, and 7, and 
if they are arranged as a matrix (? A then function composition corresponds to matrix 
multiplication. 


> type LFT = (Integer, Integer, Integer, Integer) 


> extr :: LFT -> Integer -> Rational 

> extr (q,r,s,t) x = ((fromInteger q) * x + (fromInteger r)) / 
> ((fromInteger s) * x + (fromInteger t)) 

> unit :: LFT 

> unit = (1,0,0,1) 

> comp :: LFT -> LFT -> LFT 

> comp (q,r,s,t) (u,v,w,x) = (qrutrew, qevtr*x, stutt*w, SkV+t*x) 


(The first line introduces the abbreviation LFT for the type of four-tuples of Integers.) 

The infinite composition of transformations in (4) converges, in the following sense. 
Although the products of finite prefixes of the composition have coefficients that grow 
without bound, the transformations represented by these products map the interval 
{3, 4] onto converging subintervals of itself. (This is easy to see, as each term is a 
monotonic transformation, reduces the width of an interval by at least a factor of two, 
and maps [3, 4] onto a subinterval of itself. Indeed, the same also holds for any tail of 
the infinite composition.) 
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Therefore, equation (4) can be thought of as the representation of some real num- 
ber, and computing the decimal digits of this number is a change of representation, 
effectable by a streaming algorithm. The streaming process maintains as its state an 
additional linear fractional transformation, representing the required function from the 
inputs yet to be consumed to the outputs yet to be produced. This state is initially 
the identity matrix; consumption of another input term is matrix multiplication; pro- 
duction of a digit m is multiplication by ({)) ~(°"), the inverse of the linear fractional 
transformation taking x ton + 75. If the current state is the transformation z, then the 
next digit to be produced lies somewhere in the image under z of the interval [3, 4]; 
if the two endpoints of this image have the same integer part, then that next digit is 
completely determined and it is safe to commit to it. 


> pi = stream next safe prod cons init lfts where 
> init = unit 


> <ilfts = [(k, 4*k+2, 0, 2ekt+1) | k<-[4..]] 
> next z = floor (extr z 3) 

> safe zn = (n == floor (extr z 4)) 

> prodzn = comp (10, -10#n, 0, 1) z 

> 


cons ZZ’ = comp Z 2’ 


The definition of 1£ts uses a list comprehension, with generator [1..], the infinite 
list of Integers from 1 upwards. The list consists of the expression (k, 4*k+2, 0, 
2*k+1) to the left of the vertical bar, evaluated for each value of k from 1 upwards. 
The first few terms are 


(0 3)-(o 3 )-Co 7) -} 


For example, the first term t =) represents the transformation taking x to 


lxx+6 
Oxx+3? 


or 2+ dy, 

The condensed program shown in section 1 can be obtained from this program by 
making various simple optimizations. These include: unfolding intermediate defini- 
tions; exploiting the invariant that the bottom left element s of every linear fractional 
transformation (7 ") be 0; constructing the input transformations in place; represent- 
ing the sequence of remaining transformations simply by the index k; and simplifying 
away one of the divisions. 

Another optimization that can be performed is the elimination of any factors com- 
mon to all four entries resulting from a matrix multiplication. This optimization is 
valid, since linear fractional transformations are invariant under scaling of the matrix, 
and helpful, as it keeps the numbers small. (In fact, it is better still to perform this 
cancellation less frequently than every iteration.) 


6. MORE STREAMING ALGORITHMS FOR THE DIGITS OF a. The expres- 
sion (2) turns out not to be a very efficient one for computation. Each term shrinks the 
range by a factor of about a half, so more than three terms are required on average 
for every digit of the output. Better sequences are known; the book 2 Unleashed [1] 
presents many. We conclude this paper with two more applications of the streaming 
technique from section 4 to computing 7, using the same approach but based on two 
of these different expressions. 
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Lambert’s expression. Herc is a more efficient expression for 7, due to Lambert in 
1770 [1, eq. (16.99)), that yields two decimal digits for every three terms: 

4 
x= 2 : (5) 
1+ BR 
3+ 2 
5+ 
SAE coc 


Again, one can view this as an infinite composition of linear fractional transformations: 
m= (4+) + 7+)34 22/543? +)---Qi-1t+is)--. 
After consuming i terms of the input, the remaining terms represent the composition 
(2§~14i7+)28+14+041P)2i434042)s)--, 


which denotes a value in the range [2/ — 1, 2 — | + 4]. As before, we subject this 
infinite sequence to a streaming process. This time, however, we maintain a state con- 
sisting of not just a linear fractional transformation (4 7}, but also the number i of 
terms consumed thus far (needed in order to determine the next digit to produce, which 
lies between (4 ') (2i — 1) and (¢ ') (2i - 1+ $)). 

This reasoning justifies the following program: 


> piL = stream next safe prod cons init lfts where 


> init = ((0,4,1,0), 1) 
> <1fts = [(2*i-1, iti, 1, ©) | i<x-[4..]] 
> next ((q,r,s,t),i) = floor ((q*xtr) % (s*x+t)) 
where x=2*i-1 
> safe ((q,r,s,t),i) n = ( == floor ((q*xt2*r) % (s*x+2#t))) 
> where x=5*i-2 
> prod (z,i) n = (comp (10, -10*n, 0, 1) a, i) 
> cons (z,i} z’ = (comp z z’, i+1) 


Gosper’s series. An even more efficient series for 2, yiclding more than one decimal 
digit for each term, is due to Gosper [4]: 


re Ixl we 2x3 Spe ars i(2i —1) - 
of —| oo XX eae see DES x —O-—-o7 tee : 
3x4x5 3x7x8 ' 3i + NGi +2) 
(6) 
Once more, we can view this as an infinite composition of linear fractional transfor- 
mations, namely, 
ae Lx! ae 2x3 
— —_——— ——————- x 
7 3x4x5 . 3x7x8 
i(2i — 1 
(proepa® Jorn 
3(37 + DGi + 2) 
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It is not hard to show that, after consuming 7 — 1 terms of the input, the remaining 

. . ; 333 : : 7 
terms denote a value in the range [27 ~ 4, 22) — 22-], which gives rise to the follow- 
ing program: : 


> piG = stream next safe prod cons init lifts where 

> init = ((1,0,0,1), 1) 

> ilfts = [let j = 3*(3*i+1)*(3*i+2) 

> in (i*(2*i~1),j*(5*i-2),0,j) | 
> i<-(4..]] 

> next ((q,r,s,t),i) = div (q*xt+5*r) (s*x+5*t) where 
> x = 27*i+15 

> safe ((qg,r,s,t),i) n = (mn == div (q*xt+125*r) (s*xt125*t)) 
> where x=675*1-216 

> prod (z,i) n = (comp (10, -10*n, 0, 1) z, i) 
> cons (z,i) z’ = (comp z 2’, iti) 


(The Let here is another form of local definition.) 


A challenge. Gospev’s series (6) yields more than one digit per term on average, since 
the scaling factors approach % (which is less than i) from below. This suggests that 
we could dispense with the test altogether and strictly alternate between consumption 
and production, as expressed by the following conjecture. Eliminating the test would 
speed up the algorithm considerably. 


Conjecture 1. Define the following functions: 


5 
p((z, i). n) = (: & a .i) 


c((z, i), 2) =(ez,i tl) 


. 675i — 216 
s((Z,i),n) = (: = |) ( 125 ))) 


2) 
a 0 3(3i + 1)(3i +2) 


= (Gi) 


H; = p(v;, Yi) 


Fori=1,2..., let 


Uv; = C(Hj-1, X)) 
ye = (uy). 


Then s(v;, y;) holds for alli, 


If this conjecture holds, then, in the following program, every value taken by the vari- 
able v satisfies the condition safe v (next v). 


April 2006] UNBOUNDED SPIGOT ALGORITHMS FOR THE DIGITS OF PI 327 


16. UNBOUNDED SPIGOT ALGORITHMS FOR THE DIGITS OF PI (2006) 257 


> piG2 = process next prod cons init lifts 
> process next prod cons u (x:xs) 


> = y : process next prod cons (prod v y) xs 
> where v = cons u x 
> y = next v 


(Here, next, prod, cons, init, and 1fts, as weil as the predicate safe mentioned 
in the claim, are as in section 6.) 

We have not been able to prove Conjecture 1, although we have verified it for the 
first thousand terms. Perhaps some diligent reader can provide enlightenment. If it is 
valid, then piG2 does indeed produce the digits of 7, and the optimizations outlined 
at the end of section 5 can be applied to piG2, yielding the folowing program: 


> piG3 = g(1,180,60,2) where 

> g(q,r,t,i) = 

> let (u,y)=(3* (3*i+1) *(3*i+2) ,div(q*(27*i-12)+5*r) (5*t)) 
> in y : g(10*qeix(2*i-1) ,10eu* (qe (5*i-2) +r-y*t) ,t*u,it1) 


This is of comparable Iength to the compressed program given in section 1, but ap- 


proximately five times faster. 
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“Normality: A stubborn question,” from Mathematics by Experiment: Plausible 
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produced with permission of AK Peters. 


Synopsis: 

We present here excerpts from two chapters of the second edition of our book 
Mathematics by Experiment. The first chapter presents an overview of computations 
of 7 through the ages, followed by a summary of computer-age developments, includ- 
ing a detailed analysis of the BBP formula and algorithm for computing binary digits 
of 7 beginning at an arbitrary starting position. 

The second selection from our book takes a look at a question that has puzzled 
mathematicians from time immemorial (and which has spurred many computations 
of 7): whether and why the digits of 7 and other well-known mathematical con- 
stants are “normal,” meaning that every m-long string of base-b digits appears, 
in the limit, with frequency 1/b™. Included here are some details of some recent 
results in this area, such as a proof of normality of the Stoneham numbers, namely 
constants of the form 


see oe 
Sa SB chock’ 
k=0 


where b > 2 and c> 2 are relatively prime. 
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Nowadays, this is almost trivial: A “Minpoly” calculation immediately 
returns 29—802—2427+162* = 0 and this has the surd above as its smallest 
positive root. At this point, the authors could use known results only to 
prove the value of a(1),a(2) and a(3). Those for a(5) and a(7) remained 
conjectural. There was, however, an empirical family of algorithms for 7: 
let ag = a(N) and ko = ky (where k’ = V1 — k?) and iterate 


1—k! 
knw = n ; 
' 1+ ki, oe) 
Qa = C4 eg Pag = VN 2 ek (3.23) 
Then 
lima; = . (3.24) 
noo 


Again, (3.24) was provable for N = 1,2,3 and only conjectured for 
N = 5,7. In each case the algorithm appeared to converge quadratically 
to a. On closer inspection while the provable cases were correct to 5,000 
digits, the empirical ones agreed with 7 to roughly 100 places only. Now, 
in many ways to have discovered a “natural” number that agreed with 
am to that level—and no more—would have been more interesting than 
the alternative. That seemed unlikely, but recoding and rerunning the 
iterations kept producing identical results. 

Twenty years ago, very high-precision calculation was less accessible, 
and the code was being run in a Berkeley Unix integer package. After 
about six weeks of effort, it was found that the square root algorithm in 
the package was badly flawed, but only if run with an odd precision of more 
than 60 digits! And for idiosyncratic reasons that had only been the case 
in the two unproven cases. Needless to say, tracing the bug was a salutary 
and somewhat chastening experience. 


3.4 Computing Individual Digits of Pi 


An outsider might be forgiven for thinking that essentially everything of 
interest with regards to 7 has been discovered. For example, this sentiment 
is suggested in the closing chapters of Beckmann’s 1971 book on the history 
of a [48, pg. 172]. Ironically, the Salamin—Brent quadratically convergent 
iteration was discovered only five years later, and the higher-order conver- 
gent algorithms followed in the 1980s. In 1990, Rabinowitz and Wagon 
discovered a “spigot” algorithm for 7, which permits successive digits of 
m (in any desired base) to be computed with a relatively simple recursive 
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algorithm based on the previously generated digits (see [239] and Item 15 
at the end of this chapter). 

But even insiders are sometimes surprised by a new discovery. Prior 
to 1996, almost all mathematicians believed that if you want to determine 
the d-th digit of 7, you have to generate the entire sequence of the first d 
digits. (For all of their sophistication and efficiency, the schemes described 
above all have this property.) But it turns out that this is not true, at least 
for hexadecimal (base 16) or binary (base 2) digits of 7. In 1996, Peter 
Borwein, Simon Plouffe, and one of the present authors (Bailey) found an 
algorithm for computing individual hexadecimal or binary digits of a [33]. 
To be precise, this algorithm: 


(1) directly produces a modest-length string of digits in the hexadecimal 
or binary expansion of 7, beginning at an arbitrary position, without 
needing to compute any of the previous digits; 


2) can be implemented easily on any modern computer; 


3 


(2) 

(3) does not require multiple precision arithmetic software; 
(4) requires very little memory; and 
(5) 


5) has a computational cost that grows only slightly faster than the digit 


position. 


Using this algorithm, for example, the one millionth hexadecimal digit 
(or the four millionth binary digit) of 7 can be computed in less than a 
minute on a 2003-era computer. The new algorithm is not fundamentally 
faster than best-known schemes for computing all digits of 7 up to some 
position, but its elegance and simplicity are nonetheless of considerable 
interest. This scheme is based on the following remarkable new formula 
for 7: 


Theorem 3.1. 


1/4 2 1 1 
= ee, 
. ir Gs 8i+4 8145 ass) Cay 


{= 


Proof. First note that for any k < 8, 


W/V2 k-1 isa gee 


1 loo} 
= _ 3.26 
Qk/2 —— a +k) ( ) 


17. MATHEMATICS BY EXPERIMENT (2008) 


120 3. Pi and Its Friends 


Thus one can write 


co 


so ee ee 
7 16? 8i+1 81 +4 8i+5 8i+6 


1/V2 4,/9 — 823 — 4,/2x4 — 8a° 
Z 7 ee de. (3.27) 
0 1-@ 
which on substituting y = 2x becomes 
1 1 1 
16y —16 4 Ay — 8 
lS = | iw | ts 
ge 2g +4y—4 oy 2 ie eee dae 
= 7. (3.28) 


However, in presenting this formal derivation, we are disguising the 
actual route taken to the discovery of this formula. This route is a superb 
example of experimental mathematics in action. 

It all began in 1995, when Peter Borwein and Simon Plouffe of Simon 
Fraser University observed that the following well-known formula for log 2 
permits one to calculate isolated digits in the binary expansion of log 2: 


= 
log2 = Jae (3.29) 
k=0 


This scheme is as follows. Suppose we wish to compute a few binary digits 
beginning at position d+ 1 for some integer d > 0. This is equivalent to 
calculating {27 log 2}, where {-} denotes fractional part. Thus we can write 


Pa he 


d 9d-k mo Co o5d—k 
_ {oe : uN sy — \ (3.30) 


k=0 k=d+1 


{24 log 2} 


We are justified in inserting “mod k” in the numerator of the first summa- 
tion, because we are only interested in the fractional part of the quotient 
when divided by k. 

Now the key observation is this: The numerator of the first sum in 
Equation (3.30), namely 27~* mod k, can be calculated very rapidly by 
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means of the binary algorithm for exponentiation, performed modulo k. 
The binary algorithm for exponentiation is merely the formal name for the 
observation that exponentiation can be economically performed by means 
of a factorization based on the binary expansion of the exponent. For 
example, we can write 3!7 = ((((37)?)?)?) - 3, thus producing the result 
in only 5 multiplications, instead of the usual 16. According to Knuth, 
this technique dates back at least to 200 BCE [188, pg. 461]. In our 
application, we need to obtain the exponentiation result modulo a positive 
integer k. This can be done very efficiently by reducing modulo k the 
intermediate multiplication result at each step of the binary algorithm for 
exponentiation. A formal statement of this scheme is as follows: 


Algorithm 3.2. Binary algorithm for exponentiation modulo k. 


To compute r = b” mod k, where r,b,n and k are positive integers: First 
set t to be the largest power of two such that t <n, and set r = 1. Then 
A:ifn>tthenr<«+brmodk; ne n-t; endif 
te t/2 
ift>1thenr +r? modk; goto A; endif 


Note that the above algorithm is performed entirely with positive inte- 
gers that do not exceed k? in size. Thus ordinary 64-bit floating-point or 
integer arithmetic, available on almost all modern computers, suffices for 
even rather large calculations. 128-bit floating-point arithmetic (double- 
double or quad precision), available at least in software on many systems 
(see Section 6.2.1), suffices for the largest computations currently feasible. 

We can now present the algorithm for computing individual binary dig- 
its of log 2. 


Algorithm 3.3. Individual digit algorithm for log 2. 


To compute the (d + 1)-th binary digit of log2: Given an integer d > 0, 
(1) calculate each numerator of the first sum in Equation (3.30), using 
Algorithm 3.2, implemented using ordinary 64-bit or 128-bit floating-point 
arithmetic; (2) divide each numerator by the respective value of k, again 
using ordinary floating-point arithmetic; (3) sum the terms of the first 
summation, while discarding any integer parts; (4) evaluate the second 
summation as written using floating-point arithmetic—only a few terms are 
necessary since it rapidly converges; and (5) add the result of the first and 
second summations, discarding any integer part. The resulting fraction, 
when expressed in binary, gives the first few digits of the binary expansion 
of log 2 beginning at position d+ 1. 
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As soon as Borwein and Plouffe found this algorithm, they began seek- 
ing other mathematical constants that shared this property. It was clear 
that any constant a of the form 


a = — Pk) (3.31) 


k=0 


where p(k) and q(k) are integer polynomials, with degp < degq and q 
having no zeroes at nonnegative integer arguments, is in this class. Further, 
any rational linear combination of such constants also shares this property. 
Checks of various mathematical references eventually uncovered about 25 
constants that possessed series expansions of the form given by Equation 
(3.31). 

As you might suppose, the question of whether z also shares this prop- 
erty did not escape these researchers. Unfortunately, exhaustive searches 
of the mathematical literature did not uncover any formula for 7 of the 
requisite form. But given the fact that any rational linear combination of 
constants with this property also shares this property, Borwein and Plouffe 
performed integer relation searches to see if a formula of this type existed for 
am. This was done, using computer programs written by one of the present 
authors (Bailey), which implement the “PSLQ” integer relation algorithm 
in high-precision, floating-point arithmetic [16,140]. We will discuss the 
PSLQ algorithm and related techniques more in Section 6.3. 

In particular, these three researchers sought an integer relation for the 
real vector (@1,@2,:*: ,Qn), where ay = 7m and (aj, 2 < i < n) is the 
collection of constants of the requisite form gleaned from the literature, 
each computed to several hundred decimal digit precision. To be precise, 
they sought an n-long vector of integers (a;) such that 57; a;a; = 0, to 
within a very small “epsilon.” After a month or two of computation, with 
numerous restarts using new a vectors (when additional formulas were 
found in the literature) the identity (3.25) was finally uncovered. The 
actual formula found by the computation was: 


nm = AF(1/4,5/4;1;-1/4)+ 2arctan(1/2) —log5, (3.32) 


where F'(1/4,5/4; 1; —1/4) = 0.955933837 ... is a hypergeometric function 
evaluation. Reducing this expression to summation form yields the new 7 
formula: 


1 
ee, «| 
a vielen 844 845 ss) oa) 


To return briefly to the derivation of Formula (3.33), let us point out 
that it was discovered not by formal reasoning, or even by computer-based 
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symbolic processing, but instead by numerical computations using a high- 
precision implementation of the PSLQ integer relation algorithm. It is 
most likely the first instance in history of the discovery of a new formula 
for 7 by a computer. We might mention that, in retrospect, Formula (3.33) 
could be found much more quickly, by seeking integer relations in the vector 
(7, 5$1,S2,-++ ,Sg), where 


= 1 


Such a calculation could be done in a few seconds on a computer, even if 
one did not know in advance to use 16 in the denominator and 9 terms in 
the search, but instead had to stumble on these parameters by trial and 
error. But this observation is, as they say, 20-20 hindsight. The process of 
real mathematical discovery is often far more tortuous and less elegant than 
the polished version typically presented in textbooks and research journals. 

It should be clear at this point that the scheme for computing individual 
hexadecimal digits of 7 is very similar to Algorithm 3.3. For completeness, 
we state it as follows: 


Algorithm 3.4. Individual digit algorithm for 7. 


To compute the (d+ 1)-th hexadecimal digit of 7: Given an integer d > 0, 
we can write 


{16x} = {4{1675,} — 2{16%S,} — {16455} — {164S6}}, (3.35) 


using the S; notation of Equation (3.34). Now apply Algorithm 3.3, with 


d love) 
16¢-* 16¢-* 
ist = »: p+ 5 
a {{ ri k=d a 


k=0 4 
7 Rp Se 3 oat (3.36) 
Ls BK + 5 a BRS 


instead of Equation (3.30), to compute {1679} for 7 = 1,4,5,6. Combine 
these four results, discarding integer parts, as shown in (3.35). The result- 
ing fraction, when expressed in hexadecimal notation, gives the hex digit 
of 7 in position d+ 1, plus a few more correct digits. 


As with Algorithm 3.3, multiple-precision arithmetic software is not 
required—ordinary 64-bit or 128-bit floating-point arithmetic suffices even 
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for some rather large computations. We have omitted here some numer- 
ical details for large computations—see [33]. Sample implementations in 
both C and Fortran-90 are available from http://www.experimentalmath 
info. 

One mystery that remains unanswered is why Formula (3.33) was not 
discovered long ago. As you can see from the above proof, there is nothing 
very sophisticated about its derivation. There is no fundamental reason 
why Euler, for example, or Gauss or Ramanujan, could not have discov- 
ered it. Perhaps the answer is that its discovery was a case of “reverse 
mathematical engineering.” Lacking a motivation to find such a formula, 
mathematicians of previous eras had no reason to derive one. But this still 
doesn’t answer the question of why the algorithm for computing individual 
digits of log2 had not been discovered before—it is based on a formula, 
namely Equation (3.29), that has been known for centuries. 

Needless to say, Algorithm 3.4 has been implemented by numerous re- 
searchers. In 1997, Fabrice Bellard of INRIA computed 152 binary digits of 
m starting at the trillionth binary digit position. The computation took 12 
days on 20 workstations working in parallel over the Internet. His scheme 
is actually based on the following variant of 3.33: 


Ltr 7 32 8 1 

oa 1024" (= 1+ ae+27 rl a ai) 
This formula permits individual hex or binary digits of 7 to be calculated 
roughly 43% faster than (3.25). 

A year later, Colin Percival, then a 17-year-old student at Simon Fraser 
University, utilized a network of 25 machines to calculate binary digits in 
the neighborhood of position 5 trillion, and then in the neighborhood of 40 
trillion. In September 2000, he found that the quadrillionth binary digit 
is “O,” based on a computation that required 250 CPU-years of run time, 
carried out using 1,734 machines in 56 countries. Table 3.4 gives some 
results known as of this writing. 

One question that immediately arises in the wake of this discovery is 
whether or not there is a formula of this type and an associated compu- 
tational scheme to compute individual decimal digits of 7. Searches con- 
ducted by numerous researchers have been unfruitful. Now it appears that 
there is no nonbinary formula of this type—this is ruled out by a new result 
co-authored by one of the present authors (see Section 3.7) [73]. However, 
none of this removes the possibility that there exists some completely differ- 
ent approach that permits rapid computation of individual decimal digits 


267 


17. MATHEMATICS BY EXPERIMENT (2008) 


3.5. Unpacking the BBP Formula for Pi 125 


Hex Digits Beginning 
Position at This Position 
10° 26C65E52CB4593 


10° 17AF5863EFED8D 
108 ECB840E21926EC 


10° 85895585A0428B 
101° 921C73C6838FB2 
101! 9C381872D27596 
1.25 x 10!2 07E45733CC790B 
2.5 x 10!4 E6216BO69CB6C1 


Table 3.4. Computed hexadecimal digits of 7. 


of z. Also, as we will see in the next section, there do exist formulas for 
certain other constants that admit individual digit calculation schemes in 
various nonbinary bases (including base ten). 


3.5 Unpacking the BBP Formula for Pi 


It is worth asking “why” the formula 
—ael 4 2 1 1 
= — | —— - ——_ - ——— - —— 3.38 
. i (SH 8i+4 BI+5 ars) oa 
exists. As observed above, this identity is equivalent to, and can be proved 
by establishing: 
4/2 — 823 — 4/224 — 825 
0 L=2 
The present version of Maple evaluates this integral to 


—2log2 + 2log(2 — V2) + 1 + 2log(2 + V2), (3.39) 


which simplifies to 7. In any event, one can ask what the individual series 
in (3.38) comprise. So consider 


CO 


1 
ae » 16" (8k + b) 
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for 1 <b <8 and the corresponding normalized integrals 


I(b) = 2/2 | ve ee dx (3.40) 
> é 1-2 ‘ 


Again, Maple provides closed forms for I(b) in which the basic quanti- 
ties seem to be the following: arctan(2),arctan(1/2), /2 arctan(1/V2), 
log(2), log(3), log(5), and log(V2+ 1). At this point one may use integer 
relation methods and obtain: 


=e log5 V2log(V2—1)  arctan(1/2) a V2 arctan(/2/2) 


8 8 4 4 4 
Sb = log(3) in arctan(1/2) 
4 2 
T V2 log(V/2 —1)  arctan(1/2) V2 arctan(V2/2) log 5 
S3 — ar 
4 2 2 2 4 
_ logd — log3 
=e 2 


log 5 


S5 = -> — V2log(V2 — 1) + arctan(1/2) + V2 arctan(V2/2) — 

Se6 = log3 — 2arctan(1/2) 

S7 = —m + log5 — 2V2log(V2 — 1) + 2arctan(1/2) — 2V2 arctan(v2/2) 

Sg = 8log2 — 2log5 — 2log3. (3.41) 
Thus the “simple” hexadecimal formula (3.38) is actually a molecule made 
up of more subtle hexadecimal atoms: with the final bond coming from 
the simple identity arctan 2 + arctan(1/2) = 7/2. As an immediate conse- 


quence, one obtains the formula arctan(1/2) = S2 — S¢/4. 
Furthermore, the facts that 
1 
a 


1-1 
Im (ios (1 = )) = arctan ( 
x il 
2 arctan(1/3) + arctan(1/7) arctan(1/2) + arctan(1/3) 
arctanl = 7/4 (3.42) 


allow one to write directly a base-64 series for arctan(1/3) (using x = 4) and 
a base-1024 series for arctan(1/7) (using x = 8). This yields the identity 


7 1 (-1)” 8 4 1 
a> eas 64" Gaitmetas) 


1 S(-1)" ( 32 8 1 
— —_ + ——__  —___ A 
= 56 2 Toad (otmote): ee) 
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which is similar to, although distinct from, the identity used by Bellard 
and Percival in their computations. 


3.6 Other BBP-Type Formulas 


A formula of the type mentioned in the previous sections, namely 


a = 3 PAN). (3.44) 


is now referred to as a BBP-type formula, named after the initials of the 
authors of the 1997 paper where the a hex digit algorithm appeared [33]. 
For a constant a given by a formula of this type, it is clear that individual 
base-b digits can be calculated, using the scheme similar to the ones outlined 
in the previous section. The paper [33] includes formulas of this type for 
several other constants. Since then, a large number of other BBP-type 
formulas have been discovered. 

Most of these identities were discovered using an experimental ap- 
proach, using PSLQ searches. Others were found as the result of educated 
guesses based on experimentally obtained results. In each case, these for- 
mulas have been formally established, although the proofs are not always 
as simple as the proof of Theorem 3.1. We present these results, in part, 
to underscore the fact that the approach used to find the new formula for 
m has very broad applicability. 

A sampling of the known binary BBP-type formulas (i.e., formulas with 
a base b = 2? for some integer p) is shown in Table 3.5. Some nonbinary 
BBP-type formulas are shown in Table 3.6. These formulas are derived 
from several sources: [33, 93,94]. An updated collection is available at [19]. 
The constant G that appears in Table 3.5 is Catalan’s constant, namely 
G=1-1/3?+1/5?-1/7? +--- = 0.9159655941... 

In addition to the formulas in Tables 3.5 and 3.6, there are two other 
classes of constants known to possess binary BBP-type formulas. The 
first is logarithms of certain integers. Clearly, logn can be written with 
a binary BBP formula (i.e. a formula with 6 = 2” for some integer m) 
provided n factors completely using primes whose logarithms have binary 
BBP formulas—one merely combines the individual series for the different 
primes into a single binary BBP formula. We have seen that the logarithm 
of the prime 2 possesses a binary BBP formula, and so does log 3, by the 
following reasoning: 


128 


V3 


log” 2 


T log 2 


log 3 
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II 
i) 
i 
(o) 
ge 
bo 
+ 
o 
ge 
aia 
me 
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| 
Nae” 
| 
bo 
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| + 
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> 
ii Me 
NA 
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Hs] 
= 


k=1 
tex 7 2 iT i... 2 
~ Bremner mee 3e 3) 
(a7) 
= a (3.46) 
k 
f~ 4k \ 2K +1 


oe HG gO 
24~ 64" \6k+1 °° Ch+2 6k+4  6k+5 


3 
oe oe 
824 64 \(6R+1)2 (6h+2)2 (6h+3)2 (6k+4)? 


yee 64. — G0 HCO 


, 4 1 
(6k +52 (6k+6) 


oe ‘> 1 4096 819226112 15360 
256 — 4096" (24k +1)? (24k +2)? (24K +3)? * (24k +4)? 
1024 9984 11520 2368512 

(24k +5)? © (24k +6)? *° (24k +8)? ' (24k +9)? (24k + 10)? 
768 64 4 408 720 

(24k +12)? (24k +13)? " (24k +15)? ° (24k + 16)? 

, 16  , 6 

(24k +17)? ° (24k +18)? " (24k +20)? (24k + 21)? 

1 3 1 3072, 307223040 12288 
1024 + 4096" \ (24k +1)? (24K +2)? (24K +3)? © (24k +4)? 
___ 768, _ 9216, 10368, 2496 192 

(24k +5)? © (24k +6)? * (24k +8)? © (24k +9)? (24k + 10)? 
768 48 360 648 

(24k +12)? (24k +13)? " (24k +15)? ° (24k + 16)? 

12 168 48 39 
"(24k + 17)? v (24k + 18)2 " (24k +20)? (24k + 21)? 


Table 3.5. Binary BBP-type formulas. 
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1m 1 81 54 9 12 
i ae oe ee 


3 7 2 7 1 
12k+7 12k+8 12k+10 


ee ae 729 81, _8l 9 
"729 4+ 729" \6k+1 OK+Q GK+B OKT 
9 1 
"655° 6h 3) 
ia ay i 243 405s 
27 < 729" \(12k+1)? (12k +2)? (12k + 4)? 
27 72 9 9 


(12k +5)? (12k+6)2 (12k+7)2 (12k+8)? 
2 et 
(12k +10)? ' (12k +11)? 


ve (5 “) == 7D 


; 1111111111 1 108 =A 107 10° 
387420489 =. 2 gee 10k +1. 10k+2' 10k4+3 
10° ie 10° 10? 
+ ———_ + 


*ioR+4' 10k+5 10k+6 Ohh? 


Fe eee 
10k+8 10k+9 


25 781 (57— aus\ = 5 1 
— log = as + 
2 256 \57+45/5 4 5°* \Sk+2  5k+3 


Table 3.6. sia BBP-type formulas. 
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(3.45) 


In a similar manner, it can be shown, by examining the factorization of 
2” +1 and 2” — 1, where n is an integer, that numerous other primes have 
this property. Some additional primes can be obtained by noting that the 


real part of the Taylor series expansion of 


yields a BBP-type formula. See [19] for details. 


(3.47) 
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The logarithms of the following primes are now known to possess binary 
BBP formulas [103]: 


2, 3, 5, 7, 11, 13, 17, 19, 29, 31, 37, 41, 43, 61, 73, 109, 113, 127, 151, 
241, 257, 331, 337, 397, 683, 1321, 1429, 1613, 2113, 2731, 5419, 8191, 
14449, 26317, 38737, 43691, 61681, 65537, 87211, 131071, 174763, 
246241, 262657, 268501, 279073, 312709, 524287, 525313, 599479, 
2796203, 4327489, 7416361, 15790321, 18837001, 22366891 (3.48) 


This list is certainly not complete, and it is unknown whether or not all 
primes have this property, or even whether the list of such primes is finite 
or infinite. One can also obtain BBP-type formulas in nonbinary bases for 
the logarithms of certain integers and rational numbers. One example is 
given by the base ten formula for log(9/10) in Table 3.6. This has been 
used to compute the ten billionth decimal digit of log(9/10) [33]. 

One additional class of binary BBP-type formulas that we will mention 
here is arctangents of certain rational numbers. We present here the results 
of experimental searches, using the PSLQ integer relation algorithm, which 
we have subsequently established formally. The formal derivation of these 
results proceeds as follows. Consider the set of rationals given by q = 
\Im(T)/Re(T)| or |Re(T)/Im(T)|, where 


Il (1 + x) - (1 é a (3.49) 


for various m-long nonnegative integer vectors t, u, v, w and choices 
of signs as shown [74, pg. 344]. For example, setting ¢ = (1,1), u = 
(1,1), v = (1,3), w = (1,1), with signs (1,—1,-—1,1), gives the result 
T = 25/32 — 5i/8, which yields g = 4/5. Indeed, one can obtain the 
formula 


(5) 1a (Ges 393216 491520 163840 
arctan 5 


~ oT 2 oF GORF2~ WOkFA Z0k+5 | 40k+8 
. 768 24576 5120 10240 
A0k+ 10 40k+12 40k+15' 40k+16 
2088, 102d 640480 
W0k+18 | 40k+20 ' 40k+24' 40k405 
, 28 968 
WOk+ 26 40k+28 40k+32° 40k434 
5 6 
a eo 3.50 
W0k+35  40k+ =) ea) 
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The set of rationals for which BBP formulas can be obtained in this 
way can be further expanded by applying the formula 


tanr + tans 


tan(r + s) (3.51) 


1—tanrtans’ 
for rationals r and s for which binary BBP-type formulas are found. By 
applying these methods, it can be shown that binary BBP formulas exist 
for the arctangents of the following rational numbers. Only those rationals 
with numerators < denominators < 25 are listed here. 


1/2, 1/3, 2/3, 1/4, 3/4, 1/5, 2/5, 3/5, 4/5, 1/6, 5/6, 1/7, 3/7, 

4/7, 5/7, 6/7, 1/8, 7/8, 1/9, 2/9, 7/9, 8/9, 3/10, 1/11, 2/11, 

3/11, 7/11, 8/11, 10/11, 1/12, 5/12, 1/13, 4/13, 6/13, 7/13, 9/13, 
11/13, 12/13, 3/14, 5/14, 1/15, 4/15, 8/15, 1/16, 7/16, 11/16, 13/16, 
15/16, 1/17, 4/17, 6/17, 7/17, 9/17, 11/17, 15/17, 16/17, 1/18, 13/18, 
3/19, 4/19, 6/19, 7/19, 8/19, 9/19, 11/19, 17/19, 9/20, 1/21, 13/21, 
16/21, 20/21, 3/22, 7/22, 9/22, 19/22, 21/22, 2/23, 4/23, 6/23, 7/23, 
9/23, 10/23, 11/23, 14/23, 15/23, 7/24, 11/24, 23/24, 1/25, 2/25, 
13/25, 19/25, 21/25 (3.52) 


Note that not all “small” rationals appear in this list. As it turns out, by 
applying the methods given in the paper [73] (see the next section), one 
can rule out the possibility of Machin-type BBP formulas (as described 
in Section 3.6) for the arctangents of 2/7, 3/8, 5/8, 4/9, and 5/9. Thus 
we believe the above list to be complete for rationals with numerators 
and denominators up to ten. Beyond this level, we do not know for sure 
whether this list is complete, or whether applying formula (3.49), together 
with addition and subtraction formulas, generates all possible rationals 
possessing binary BBP-type formulas. 

One can obtain BBP formulas in nonbinary bases for the arctangents of 
certain rational numbers by employing an appropriate variant of formula 
(3.49). 


3.7. Does Pi Have a Nonbinary BBP Formula? 


As we mentioned above, from the day that the BBP-formula for 7 was 
discovered, many researchers have wondered whether there exist BBP-type 
formulas that would permit computation of individual digits in bases other 


17. MATHEMATICS BY EXPERIMENT (2008) 


132 3. Pi and Its Friends 


than powers of two (such as base ten). This is not such a far-fetched 
possibility, because both base-2 and base-3 formulas are known for 77, as 
well as for log 2 (see Tables 3.5 and 3.6). But extensive computations failed 
to find any nonbinary formulas for 7. 

Recently one of the present authors, together with David Borwein (Jon’s 
father) and William Galway, established that there are no nonbinary Machin- 
type arctangent formulas for 7. We believe that if there is no nonbinary 
Machin-type arctangent formula for 7, then there is no nonbinary BBP- 
type formula of any form for 7. We will summarize this result here. Full 
details and other related results can be found in [73]. 

We say that the integer b > 1 is not a proper power if it cannot be 
written as c” for any integers c and m > 1. We will use the notation 
ord,(z) to denote the p-adic order of the rational z € Q. In particular, 
ord,(p) = 1 for prime p, while ord,(q) = 0 for primes g # p, and ord, (wz) = 
ord,(w) + ord,(z). The notation 1(p) will mean the order of the integer 
b in the multiplicative group of the integers modulo p. We will say that 
p is a primitive prime factor of b™ — 1 if m is the least integer such that 
p|(b™ —1). Thus p is a primitive prime factor of b™ —1 provided 1)(p) = m. 
Given the Gaussian integer z € Q[?] and the rational prime p = 1 (mod 4), 
let 0,(z) denote ord,(z) — ordg(z), where p and f are the two conjugate 
Gaussian primes dividing p, and where we require 0 < S(p) < R(p) to 
make the definition of 8, unambiguous. Note that 


6,(wz) = Op(w) +6,(z). (3.53) 


Given k& € R, with 2 < b © Z and b not a proper power, we say that 
& has a Z-linear or Q-linear Machin-type BBP arctangent formula to the 
base b if and only if & can be written as a Z-linear or Q-linear combination 
(respectively) of generators of the form 


1 i = (—1)" 
aa) a —) = pe y a 
arctan (=) Slog (1 + =) b a PEO F1) (3.54) 


We will also use the following theorem, first proved by Bang in 1886: 


Theorem 3.5. The only cases where b™ — 1 has no primitive prime fac- 
tor(s) are when b = 2, m =6, b™—1 = 37-7; and whenb =2N -1, NE Z, 
m= 2. = l= oN) = 1), 


We can now state the main result of this section: 


Theorem 3.6. Given b > 2 and not a proper power, then there is no Q- 
linear Machin-type BBP arctangent formula for 7. 
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Proof. It follows immediately from the definition of a Q-linear Machin- 
type BBP arctangent formula that any such formula has the form 


1 M 
= = S)) rmFlog(b™ - i) (3.55) 
nm 
m=1 


wheren >0€ Z, nm € Z, and M > 1, ny #0. This implies that 


M 
[[@"-a"" €e"""™Q% = Q” (3.56) 


m=1 


For any 6 > 2 and not a proper power we have M, < 2, so it follows 
from Bang’s Theorem that b* — 1 has a primitive prime factor, say p. 
Furthermore, p must be odd, since p = 2 can only be a primitive prime 
factor of b™—1 when b is odd and m = 1. Since pis a primitive prime factor, 
it does not divide b?” —1, and so p must divide 6?” +1 = (b@ +i)(b™ —i). 
We cannot have both plb™ + % and p|b” — i, since this would give the 
contradiction that p|(b! + 7) — (bM@ — i) = 2i. It follows that p = 1 
(mod 4), and that p factors as p = pp over Z{i], with exactly one of p, P 
dividing b“ — i. Referring to the definition of 6, we see that we must have 
6,(b“ —i) £0. Furthermore, for any m < M, neither p nor p can divide 
b™ — i since this would imply p | 4” — 1, 4m < 4M, contradicting the 
fact that p is a primitive prime factor of b4” — 1. So form < M, we have 
6,(b” — 7) = 0. Referring to equation (3.55), using Equation (3.53) and 
the fact that ny, #0, we get the contradiction 


0 FF nyjOp(b = Yat (o" =7) = 6,(Q*) =0. (3.57) 


Thus our assumption that there was a b-ary Machin-type BBP arctangent 
formula for 7 must be false. 


3.8 Commentary and Additional Examples 


1. The ENIAC Integrator and Calculator. ENIAC, built in 1946 
at the University of Pennsylvania, had 18,000 vacuum tubes, 6,000 
switches, 10,000 capacitors, 70,000 resistors, 1,500 relays, was 10 feet 
tall, occupied 1,800 square feet, and weighed 30 tons. ENIAC could 
perform 5,000 arithmetic operations per second—1,000 times faster 
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Figure 3.3. The ENIAC computer. Courtesy of Smithsonian Institution. 


than any earlier machine, but a far cry from today’s leading-edge mi- 
croprocessors, which can perform more than four billion operations 
per second. The first stored-memory computer, ENIAC could store 
200 digits, which again is a far cry from the hundreds of megabytes in 
a modern personal computer system. Data flowed from one accumu- 
lator to the next, and after each accumulator finished a calculation, it 
communicated its results to the next in line. The accumulators were 
connected to each other manually. A photo is shown in Figure 3.3. 


2. Four approximations to pi. Here are two well known, but fasci- 
nating, approximations to 7: 


TX 


3 
= log (640320) , 
Visa g ( ) 


correct to 15 decimal places, and 


3 


TS 


log (5280) , 


S 
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correct to 9 decimal places. Both rely on somewhat deeper number 
theory (see Section 1.4 in the second volume). Here are two nice 
algebraic 7 approximations: 


V2 


mt = 66 ————_ 
33.,/29 — 148 


and 
_ 638 17+ 15 V5 


Tx =a — ——. 
25 7415/5 


3. An arctan series for pi. Find rational coefficients a; such that the 
identity 


7T = ayzarctan + ag arctan 


1 1 
390112 485298 


+az3 arctan + a4arctan 


1 1 
683982 1984933 


1 
t _—— 
TG ARGUED Tage 


1 
tt; ——— 
Fas arctan 9773398 


+ Ag arctan 


: 1 
+a7 arctan [57550] 7 22709274 


+ag arctan + G19 arctan 


1 
24208144 | 201229582 


Fans arctan 5739376182 

holds [10, pg. 75]. Also show that an identity with even simpler 
coefficients exists if arctan 1/239 is included as one of the terms on 
the RHS. Hint: Use an integer relation program (see Section 6.3), or 
try the tools at one of these sites: http://oldweb.cecm.sfu.ca/projects/ 
IntegerRelations or http://www.experimentalmath.info. 


4. Ballantine’s series for pi. A formula of Euler for arccot is 


. (n!)? A" _ a 
pe Qnd)! (@ a1 = arctan (=) . (3.58) 


As observed by Ballantine in 1939, ( [49]) this allows one to rewrite 
the variant of Machin’s formula, used by Guilloud and Bouyer in 1973 
to compute a million digits of 7, 


1 1 1 
“ = 12arctan ia + 8 arctan a7 5 arctan 330 (3.59) 
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in the neat form 


(n!)? 4" 


— 804 So ee te Se 


n=0 


1 
— 20 arctan — 3.60 
arctan 535) (3.60) 


where the terms of the second series are just decimal shifts of the 
terms of the first. 


. Convergence rates for pi formulas. Analyze the rates of con- 


vergence of Archimedes iteration (3.1), the Salamin-Brent iteration 
(3.16), the Borwein cubic iteration (3.17) and the Borwein quartic 
iteration (3.18), by means of explicit computations. Use the high- 
precision arithmetic facility built into Maple or Mathematica, or write 
your own C++ or Fortran-90 code using the ARPREC arbitrary 
precision software available at http://www.experimentalmath.info, or 
the GNU multiprecision software available at http://www.gnu.org 
/software/gmp/gmp.html. Such iterations are discussed more in Sec- 
tions 5.6.2 and 5.6.3. 


Biblical pi. As noted in Section 3.1, the Biblical passages 1 Kings 
7:23 and 2 Chronicles 4:2 indicate that 7 = 3. In spite of the fact the 
context of these verses clearly suggests an informal approximation, 
not a precise statement of mathematical fact, this discrepancy has 
been a source of consternation among Biblical literalists for centuries. 
For example, an 18th-century German Bible commentary attempted 
to explain away this discrepancy using the imaginative (if pathetic) 
suggestion that the circular pool in Solomon’s temple (clearly de- 
scribed in 2 Chron. 4:2 as “round in compass”) was instead hexago- 
nal in shape [48, pg. 75-76]. Even today, some are still unwilling to 
accept that the Bible could simply be mistaken here. One evangelical 
scholar, for example, writes: 


However, the recorded dimensions are still no problem if we 
consider the shape of the vessel. In 1 Kings 7:26, we read that 
its “brim was made like the brim of a cup, as a lily blossom.” 
Hence, the sea was not a regular cylinder, but had an outward 
curving rim. Although we do not know the exact points on 
the vessel where the measurements were taken, the main part 
of the sea always will be somewhat smaller than the 10 cubits 
measured “from brim to brim.” [213] 
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Another geometric difficulty in these Biblical passages is that 1 Kings 
7:26 (three verses after it gives its dimensions) gives a volume of 2,000 
“baths” for the basin, while 2 Chron. 4:5 gives the figure 3,000 baths. 
2 Chron. 4:2 gives the height of the basin as five cubits. Using the 
accepted conversions that one “cubit” is roughly 46 cm, and that 
one “bath” is roughly 23 liters, then assuming Solomon’s pool was 
cylindrical in shape, we obtain an actual volume of roughly 1660 
baths. If the basin was rounded on the bottom, then its volume was 
even lower than this. 


7. Exponentiation of pi. Arguably the most accessible transcendental 
number to compute is e”, which can be computed using the following 
iteration. 


Algorithm 3.7. Computation of exp(7). 


Set ko = 1/V2 and for n < N = [log,(D/1.36)] iterate 


1k 
14+ ki" 


/ 
n 


Ko=V1—-k, kn = 


hive =1/2"— 
7 : 


Some care needs to be taken with guard digits. 


Then return 


8. Algorithms for Gamma values. An algorithm for 7 may be 


viewed as an algorithm for (4), and there is a quite analogous 


iteration for I’ at the values 1/3, 2/3, 1/4, 3/4, 1/6, and 5/6. This, 
in turn, allows rapid computation of IT (+) , for all integer k. We 
illustrate with: 


Algorithm 3.8. Computation of I (4). 
Let #9 = 24/7, y, = 21/4. Let 


Tn4+1 = 


as shin (3.61) 


YnyVtn + Ven (3.62) 
Yn +1 


Yati = 
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Then 


oo 3 
i (=) =16(1+ +2)? |[ 2," (=) ; 
4 ar 1+Yn 


This yields a quadratically convergent iteration for T (4). 


No such iteration is known for [ (4); see [68, 74]. 


. An integral representation of Euler’s constant. While it is 


known that [ (4) and [ (4) are transcendental, the status of Euler s 
constant 


wl 
7= jim 7: log(n) (3.63) 
k=1 
is unsettled. 
Problem: Show that 


Computation of Euler’s constant. Perhaps the most efficient 
method of computation, due to Brent and MacMillan ( [74, pg. 336]), 
is based on Bessel function identities. It allows one to show that if 
y is rational it must have a denominator with millions of digits. The 
underlying identity, known to Euler, is 


So(z) — Ko(z) 
To(z) 
where I,(z) = VP. 9(2/2)7**”/(kIT(k + v + 1)), while Ko(z) = 

AIq(z)/Ov and So(z) = WRo(N jar 1/4) (2/2)7*/ (Rh)? 


An algorithm follows from knowing the first terms of the asymptotic 
expansion for Ko and Io. It is 


y + log(z/2) = (3.64) 


Algorithm 3.9. Computation of Ko and Io: 


Ao = — log(n), Bo = 1, Uo = Ao, Vo = 1, and for k = 1,2,---, 


By. = Bp_in?/k?, A, = (Ap_in?/k a Bx) /k, 
U, = Up-it+Ak, Ve = Ve-1 + Br. (3.65) 
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1. 


12. 


13. 


14. 


Terminate when U;, and V; no longer change, and return y ~ U;,/V,. 
With log(n) computed efficiently, this scheme takes O(D) storage and 
approximately 2.07 D steps to compute y to D decimal places. 


Buffon’s needle. Suppose we have a lined sheet of paper, and a 
needle that is precisely as long as the distance between the lines. 
Compute the probability that the needle “thrown at random” on the 
sheet of paper will lie on a line. Answer: 2/7. Although this is 
certainly not a good way to calculate 7 (millions of trials would be 
required to obtain just a few digits), it is an instructive example of 
how 7 arises in unlikely settings (see also the next two exercises). 
Some additional discussion of this problem, plus a computer-based 
tool that allows one to perform these trials, is available at the URL 
http://www.mste.uiuc.edu/reese/buffon/buffon.html. 


Putnam problem 1993-B3. If two real numbers x and y are gen- 
erated uniformly at random in (0,1), what is the probability that the 
nearest integer to x/y is even? Hint: Ignoring negligible events, for 
this to occur either 0 < a/y < 1/2 or (4n—1)/2 < a/y < (4n+1)/2. 
The first occurs in a triangle of area 1/4 and the subsequent in trian- 
gles of area 1/(4n — 1) — 1/(4n +1). Now apply the Gregory-Leibniz 
formula. Answer: The probability is (5 — 1)/4 = 0.4646018. 


Number-theory probabilities. Prove (a) The probability that an 
integer is square-free is 6/m?. (b) The probability that two integers 
are relatively prime is also 6/77. This is a good example of 7 appear- 
ing in a number-theory setting. See [163]. 


The irrationality of pi. We reproduce in extenso Ivan Niven’s 1947 
very concise proof that 7 is irrational [224]. 


Let 7 = a/b, the quotient of positive integers. We define the polyno- 


mials 
x" (a — bx)” 


P@)=f@)-i/"@ +I" @)a + Cre" @), 


the positive integer n being specified later. Since n! f(x) has integral 
coefficients and terms in x of degree not less than n, f(x) and its 
derivatives f (x) have integral values for x = 0; also for 2 = 7 = 
a/b, since f(x) = f(a/b— 2x). By elementary calculus we have 


+ {F"(2) sinz — F(x) cosx} = F" (x) sing + F(x) sinz = f(x) sing 
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and 
[ f(x) sinadz = [F’(x) sinxz — F(x) cosz]j = F(x) + F(0). (3.66) 


Now F(m) + F(0) is an integer, since f)(0) and f(z) are integers. 
But for0<a<q7, 


TL yt 


0< f(x)sina < ar 
so that the integral in (3.66) is positive but arbitrarily small for n 
sufficiently large. Thus (3.66) is false, and so is our assumption that 
Tm is rational. 


This proof gives a good taste of the ingredients of more subtle irra- 
tionality and transcendence proofs. 


A spigot algorithm for e and pi. A spigot method for a numerical 
constant is one that can produce digits one by one (“drop by drop”) 
[239]. This is especially easy for e as carries are not a big issue. 


(a) The following algorithm, due to Rabinowitz and Wagon, gener- 
ates successive digits of e. Initialize an array A of length n+ 1 
to 1. Then repeat the following n — 1 times: (a) multiply each 
entry in A by ten; (b) Starting from the right, reduce the i-th 
entry of A modulo i+ 1, carrying the quotient of the division 
one place left. The final quotient produced is the next digit of 
e. This algorithm is based on the following formula, which is 
simply a restatement of e = )> 1/il. 


r= tet oob(oed(-H(Oe40-9))) 


(b) Implement a parallel spigot algorithm for 7, based on showing 
that: 


rane g (ae 5 (247 (2+ (tte) )}): 


The last term can be approximated by 2 + 4k/(2k + 1) where 
k = log,(10)n to produce n digits of a “drop by drop.” If one 
wishes to run the algorithm without a specified end, one must 
take more care. 
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One may view the iteration (4.7), which we will study in Chapter 4, 
as a spectacular (albeit unproven) spigot algorithm for 7 base 16. 


16. Wagon’s BBP identity. Determine the range of validity of the 
following identity, which is due to Stan Wagon: 


1—22-2? 
am + 4arctan z+ 2 log (=>) a 


zetl 


1+ 8k 4+ 8k 5+ 8k 6+ 8k 


9 1. (= 4 ree Q(z a ier (z 4 18h (z 4 | 
16 


17. Monte Carlo calculation of pi. Monte Carlo simulation was pio- 
neered during the Manhattan project by Stanislaw Ulam and others, 
who recognized that this scheme permitted simulations beyond the 
reach of conventional methods on the systems then available. We il- 
lustrate here a Monte Carlo calculation of 7, which is a poor method 
to compute 7, but illustrative of this general class of computation. 
Nowadays, Monte Carlo methods are quite popular because they are 
well suited to parallel computation on systems such as “Beowulf” 
clusters. 


(a) Design and implement a Monte Carlo simulation for 7, based 


wa 


on generating pairs of uniformly distributed numbers in the unit 
square and testing whether they lie inside the unit circle. Use 
the pseudorandom number generator zo = 314159 and rz, = 
c£,_1 mod 237, where c = 59 = 1953125. This generator is 
of the well known class of linear congruential generators and 
has period 2°° [188, pg. 21]. It can be easily implemented on a 
computer using IEEE 64-bit “double” datatype, since the largest 
integer that can arise here is less than 2°°. Variations with longer 
periods can easily be designed, although the implementation is 
not as convenient. The results of this generator are normalized, 
by 2°? in this case, to produce results in the unit interval. 


Extend your program to run on a parallel computer system, with 
the property that your parallel program generates the same over- 
all scheme of pseudorandom numbers, and thus gets the same 
result for 7, as a serial implementation (you may for conve- 
nience assume that n, the total number of pseudorandom num- 
bers generated, is evenly divisible by p, the number of proces- 
sors). This is a very desirable feature of a parallel program, 
because it allows you to certify your parallel results by com- 
paring them with a conventional single-processor run, and it 
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permits you to take advantage of a range of system sizes. Hint: 
Design the program so that processor k (where processors are 
numbered from 0 to p — 1) generates the m = n/p members of 
the sequence (2pm, Ckm+1;Tkm+2)°'*; Tkm+m—1). Note that the 
starting value x, for processor k can be directly computed as 
Lkm = 5°*™axg mod 232. This exponentiation modulo 23? may 
be performed by using Algorithm 3.2, implemented with 128-bit 
floating-point or “double-double” arithmetic (see Section 6.2.1). 


Generate the following sequence of pseudorandom numbers and 
experimentally determine what distribution they satisfy (i.e., by 
computing means, standard deviations, graphs, etc.): Let 1 and 
x2 be a pair of uniform (0,1) pseudorandom numbers generated 


as described above. Set v = \/zj7 +23 and w = \/—2logu/v. 


Then produce the results yy = wa, and y2 = wx. 


18. Life of Pi. At the end of his story, Piscine (Pi) Molitor [215, pp.316— 


7] writes 


Iam a person who believes in form, in harmony of order. Where 
we can, we must give things a meaningful shape. For example—I 
wonder—could you tell my jumbled story in exactly one hundred 
chapters, not one more, not one less? I'll tell you, that’s one 
thing I hate about my nickname, the way that number runs on 
forever. It’s important in life to conclude things properly. Only 
then can you let go. 


We may not share the sentiment, but we should celebrate that Pi 
knows 7 to be irrational. 
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Anyone who wants to make a name for himself can examine the major 
issue of whether z is normal, or perhaps more accurately, whether 7 
is not normal. 


— Jérg Arndt and Christoph Haenel, Pi Unleashed, 2001 


In this chapter, we address a fundamental problem of mathematics, a para- 
dox of sorts: Whereas on one hand it can be proven that “almost all” real 
numbers are normal, and whereas it appears from experimental analysis 
that many of the fundamental constants of mathematics are normal to 
commonly used number bases, as yet there are no proofs, nor even any 
solid reason why we should observe this behavior. What we shall show 
here is that the theory of BBP constants, which as we have seen is a classic 
case study of experimental mathematics in action, opens a pathway into the 
investigation of normality, and in fact has already yielded some intriguing 
results. 


4.1. Normality: A Stubborn Question 


Given a real number a and an integer b > 2, we say that a is b-normal 
or normal base b if every sequence of k consecutive digits in the base-b 
expansion of a appears with limiting frequency b-*. In other words, if 
a constant is 10-normal, then the limiting frequency of “3” (or any other 
single digit) in its decimal expansion is 1/10, the limiting frequency of 
“58” (or any other two-digit pair) is 1/100, and so forth. We say that a 
real number a is absolutely normal if it is b-normal for all integers b > 1 
simultaneously. 

In spite of these strong conditions, it is well known from measure the- 
ory that the set of absolutely normal real numbers in the unit interval has 
measure one, or in other words, that almost all real numbers are abso- 
lutely normal (see Exercise 1 at the end of this chapter). Further, from 
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numerous analyses of computed digits, it appears that many of the fun- 
damental constants of mathematics are normal to commonly used number 
bases. By “fundamental constants,” we include 7, e, V2, the golden mean 
7 = (1+ V5)/2, as well as logn and the Riemann zeta function ¢(n) for 
positive integers n > 1, and many others. For example, it is a reasonable 
conjecture that every irrational algebraic number is absolutely normal, 
since there is no known example of an irrational algebraic number whose 
decimal expansion (or expansion in any other base) appears to have skewed 
digit-string frequencies. 

Decimal values are given for a variety of well known mathematical con- 
stants in Table 4.1 [99,142]. In addition to the widely recognized constants 
such as 7 and e, we have listed Catalan’s constant (G), Euler’s constant 
(y), an evaluation of the elliptic integral of the first kind K(1/V2), an eval- 
uation of an elliptic integral of the second kind E(1/V2), Feigenbaum’s a 
and 6 constants, Khintchine’s constant K, and Madelung’s constant M3. 
Binary values for some of these constants, as well as Chaitin’s Q constant 
(from the field of computational complexity) [99], are given in Table 4.2. 
As you can see, none of the expansions in either table exhibits any evident 
“pattern.” 

The digits of 7 have been studied more than any other single constant, 
in part because of the widespread fascination with 7. Along this line, 
Yasumasa Kanada of the University of Tokyo has tabulated the number 
of occurrences of the ten decimal digits “0” through “9” in the first one 
trillion decimal digits of 7. These counts are shown in Table 4.3. For 
reasons given in Section 3.4, binary (or hexadecimal) digits of 7 are also of 
considerable interest. To that end, Kanada has also tabulated the number 
of occurrences of the 16 hexadecimal digits “0” through “F,” as they appear 
in the first one trillion hexadecimal digits. These counts are shown in Table 
4.4, As you can see, both the decimal and hexadecimal single-digit counts 
are entirely reasonable. 

Some readers may be amused by the LBNL PiSearch utility, which is 
available at http://pisearch.lbl.gov. This online tool permits one to enter 
one’s name (or any other modest-length alphabetic string, or any modest- 
length hexadecimal string) and see if it appears encoded in the first several 
billion binary digits of 7. Along this line, a graphic based on a random 
walk of the first million decimal digits of 7, courtesy of David and Gregory 
Chudnovsky, is shown in Figure 4.1 (see Color Plate VIII). It maps the 
digit stream to a surface in ways similar to those used by Mandelbrot and 
others. 

As we mentioned in Section 3.2, the question of whether 7, in particular, 
or, say, V2, is normal or not has intrigued mathematicians for centuries. 
But in spite of centuries of effort, not a single one of the fundamental 
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Constant Value 

V2 1.4142135623730950488. .. 
V3 1.7320508075688772935. .. 
V5 2.2360679774997896964. .. 

o= v1 0.61803398874989484820. .. 
m  3.1415926535897932384... 

1/m 0.31830988618379067153... 
e€  2.7182818284590452353... 

1/e 0.36787944117144232159... 
e™  23.140692632779269005... 

log2 0.69314718055994530941 ... 
log10 2.3025850929940456840... 
logy 10 3.3219280948873623478... 

logi9 2 0.30102999566398119521... 
log,3 — 1.5849625007211561814... 
C(2) 1.6449340668482264364... 
¢(3) 1.2020569031595942854... 
¢(5) 1.0369277551433699263... 

G  0.91596559417721901505... 

y  0.57721566490153286060... 
T(1/2) = Vm = 1.7724538509055160272... 
T(1/3) 2.6789385347077476336 ... 
T(1/4) 3.6256099082219083119... 
K(1/V2) 1.8540746773013719184... 
E(1/V2) 1.3506438810476755025... 
af  4.6692016091029906718... 
Of 2.5029078750958928222... 
K 2.6854520010653064453... 
Mz 1.7475645946331821903... 


Table 4.1. Decimal values of various mathematical constants. 


Constant Value 


am 11.001001000011111101101010100010001000010110100011000010001 ... 

e 10.101101111110000101010001011000101000101011101101001010100... 
V2 1.0110101000001001111001100110011111110011101111001100100100 ... 
V3 1.1011101101100111101011101000010110000100110010101010011100... 
log 2 0.1011000101110010000101111111011111010001110011110111100110... 
log 3 1.0001100100111110101001111010101011010000001100001010100101 ... 
QQ 0.0000001000000100001000001000011101110011001001111000100100 ... 


Table 4.2. Binary values of various mathematical constants. 
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Digit 
0 
1 
2 
3 
4 
5 
6 
is 
8 
9 


Total 


Occurrences 
99999485134 
99999945664 

100000480057 
99999787805 
100000357857 
99999671008 
99999807503 
99999818723 
100000791469 
99999854780 


1000000000000 


4. Normality of Numbers 


Table 4.3. Statistics for the first trillion decimal digits of 7. 


Digit 


PAD QAWEFEOANDoRwNnNr oO 


a 


Tota 


Occurrences 
62499881108 
62500212206 
62499924780 
62500188844 
62499807368 
62500007205 
62499925426 
62499878794 
62500216752 
62500120671 
62500266095 
62499955595 
62500188610 
62499613666 
62499875079 
62499937801 


1000000000000 


Table 4.4. Statistics for the first trillion hexadecimal digits of 7. 
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Figure 4.1. A random walk based on one million digits of 7 (see Color Plate VIII). 
Courtesy of David and Gregory Chudnovsky. 


constants of mathematics has ever been proven to be b-normal for any 
integer b, much less for all integer bases simultaneously. And this is not for 
lack of trying—some very good mathematicians have seriously investigated 
this problem, but to no avail. Even much weaker results, such as the digit 
“1” appears with nonzero limiting frequency in the binary expansion of 7, 
and the digit “5” appears infinitely often in the decimal expansion of V2, 
have heretofore remained beyond the reach of modern mathematics. 

One result in this area is the following. Let f(m) = 11 <j<,[logio JJ. 
Then the Champernowne number, 


CO 


Ss peo = 0.12345678910111213141516171819202122232425 ..., 
n=1 


where the positive integers are concatenated in a decimal value, is known to 
be 10-normal (See Exercise 5). There are similar constants and normality 
results for other number bases. However, no one, to the authors’ knowl- 
edge, has ever argued that this constant and its relatives are “natural” or 
“fundamental” constants. 

Consequences of a proof in this area would definitely be interesting. For 
starters, such a proof would immediately provide an inexhaustible source 
of provably reliable pseudorandom numbers for numerical or scientific ex- 
perimentation. We also would obtain the mind-boggling but uncontestable 
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consequence that if 7, for example, is shown to be 2-normal, then the en- 
tire text of the Bible, the Koran and the works of William Shakespeare, 
as well as the full ATX source text for this book, must all be contained 
somewhere in the binary expansion of 7, where consecutive blocks of eight 
bits (two hexadecimal digits) each represent one ASCII character. Unfortu- 
nately, this would not be much help to librarians or archivists, since every 
conceivable misprint of each of these books would also be contained in the 
binary digits of 7. 

Before continuing, we should mention the “first digit” principle, also 
known as Benford’s principle. In the 1880s, Simon Newcomb observed a 
pattern in the first digits of logarithm tables: A “1” is significantly more 
likely to occur than “2,” a “2” more than a “3,” and so on. In other words, 
the collection of first digits of data in logarithm tables certainly does not 
reflect the statistics expected of 10-normal numbers. In the 20th century, 
Frank Benford rediscovered this phenomenon, noting that it applies to 
many types of numerical data, ranging from values of physical constants to 
census data to the stock market. One can deduce this principle by observing 
that natural laws surely cannot be dependent on our choice of units, and 
thus must be scale-independent. This suggests that we view numerical data 
on a logarithmic scale. In the logarithmic sense, a leading “1” appears 
roughly 30% of the time (since log; 2 — logig 1 = 0.30102999...), a “2” 
appears roughly 17.6% of the time (since log;) 3 — log;) 2 = 0.1760912...), 
and so on. More recently scientists have applied Benford’s principle in 
diverse ways, including fraud detection in business accounting [172]. 


4.2 BBP Constants and Normality 


Until recently, the BBP formulas mentioned in Sections 3.4 and 3.6 were 
assigned by some to the realm of “recreational” mathematics—interesting 
but of no serious consequence. But the history of mathematics has seen 
many instances where results once thought to be idle curiosities were later 
found to have significant consequences. This now appears to be the case 
with the theory of BBP-type constants. 

What we shall establish below, in a nutshell, is that the 16-normality 
of (which, of course, is equivalent to the 2-normality of 7), as well as the 
normality of numerous other irrational constants that possess BBP-type 
formulas, can be reduced to a certain plausible conjecture in the theory 
of chaotic sequences. At this time we do not know the full implications 
of this result. It may be the first salvo in the resolution of this age-old 
mathematical question, or it may be merely a case of reducing one very 
difficult mathematical problem to another. But at the least, this result 
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Paper 18: Stephen K. Lucas, “Approximations to 7 derived from integrals with 
nonnegative integrands,” American Mathematical Monthly, vol. 116 (2009), p. 166— 
172. Copyright 2009 Mathematical Association of America. All Rights Reserved. 


Synopsis: 

In this article, Stephen Lucas addresses the classic grade school approximation 
a = 22/7. He shows how this venerable approximation can be seen in context with 
approximations based on integrals, such as 


1 4/1 _ »\4 
: x*(1— 2) de = 22 . 
9 l+2? 7 
Lucas then exhibits other integrals that yield even better approximations, such as 
iP (1 — 7)8(25 + 81627) | 355, 
0 


t= TT. 


3164(1 + 2?) 113 
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NOTES 
Edited by Ed Scheinerman 


Approximations to 7 Derived from 
Integrals with Nonnegative Integrands 


Stephen K. Lucas 


One of the more intriguing results related to approximating 7 is the integral 


'y4(1 — x)! py: 
0 1+ x? 7 


Since the integrand is nonnegative and “small” (as we shal] shortly see) on the interval 
[0, 1], it shows that wz is smaller than 22/7, and that this rational approximation is 
inded a good approximation to it. 

The earliest statement of (1) that we are aware of is Dalzell [6] in 1944. Proving 
(1) was a question on a University of Sydney examination in November 1960 (Bor- 
wein et al. [4]), and it was apparently shown by Kurt Mahler to his students in the mid 
1960s. Proving (1) was also the first question in the William Lowell Putnam mathe- 
matical competition of December 1968, as published by McKay {10] in 1969. In 1971, 
Dalzell [7] again derived (1) in a larger work published in the Cambridge student jour- 
nal Eureka. This paper is the one most often cited in connection with the result (1) 
(e.g., Backhouse [2] and Borwein et al. [4]). It was also presented without reference in 
Comwell [5] in 1980. A more recent reference is Medina [11]. 

In this article we shall look at some features of this integral, including error bounds 
and a related series expansion. Then, we present a number of generalizations, including 
a new series approximation to 7 where each term adds as many digits of accuracy 
as desired. We conclude by presenting a number of related integral results for other 
continued fraction convergents of 7. 


1. THE CLASSIC INTEGRAL. Proving (1) is not difficult, if perhaps somewhat 
tedious. A partial fraction decomposition leads to 


x4(1—x)* 4 
EES 58 Sh aA id a. : 
14+ .x? 1+ x? 


(2) 


and (1) immediately follows by integration. An alternative is to use the substitution 
x = tan@, leading to 


f x*(1 — x4 fr tan* (1 — tan@)* 
Sy aaa Sen aN 
0 0 


2 
= 6 dd 
1+x? sec? 6 me 


m/4 7 
=f tan’ é — 4tan? 6 + 6tan® 6 — 4tan’ 6 + tan’ 6 dé. 
9] 


This can be solved using the recurrence relation (tan” @ = tan”~* @ (sec? @ — 1)) 


fd 1 xf/4 
i tan"6 dé = — : tan"-* 6 dé, 
0 


0 n—1| 
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with 
aja a x/4 
| do = —, [ tan@ dé = Inv2, 
9 4 0 


which returns the required result after some algebra. Of course, the simplest approach 
today is to verify (1) using a symbolic manipulation package. 


1.1. Error estimation. As well as showing 22/7 > a we can use (1) to get bounds on 
the error. One approach, following Nield [12], is to note that since x(1 — x) < 1/4 and 
1 +x? > Lon [0, 1] with equality only at the endpoints, the integrand takes maximum 
value (1/4)* = 1/256. Combined with the fact that 22/7 — z is positive, we get the 
error bound 


5625 22 ] 22 

1792 7 += 256 qe 
However, a better bound can be found by noting that | < 1+ x? <2for0<x <1, 
and ie x*(1 — x)4 dx = 1/630 (as in Dalzell (6, 7] and Nield [13]). Then 


1 22 1 1979-22 1 22 1 3959 
—~—- < —-a< = <a < a 


Te OF me Ee KT = = : 
1260 7 630 630 7 630 7 1260 1260 


The interval [1979/630, 3959/1260] is of width 7.94 x 10~+, and is not centered at 7. 


1.2. Series expansion. Dalzell [6] also provides a series expansion for z based upon 
(2), which is included in Borwein et al. [4] as an example. After gathering the two 
pieces with 1 + x? as the denominator, we can write 


1 x6 — 4x9 + 5x4 — 4x? 4.4 4 x§ ~ 4x9 + 5x4 — 4x7 +4 
—_————_- or = 
14+x%° 44+ x4(1—x)4 1+ x? 14+ x41 — x)4/4 


Integrating both sides between 0 and 1 and using the Taylor series expansion for 
1/C1 + 1) leads to 


0 ] kK pl ; ; 
r= Ss (-3) i (x® — 4x9 + 5x4 — 4x? + 4x" — x)* dx. (3) 


k=0 


Applying integration by parts n times reducing the exponent of (1 — x), we have 


min! 
my "dx= Se 4 
fv ssi er ea ®) 

where m and n are nonnegative integers, which when applied to (3) gives 
= 3 ( ] ) (4k)! (4k +6)! 4(4K)' (4k +5)! 54K)! (4k + 4)! 
ANS (8k +7)! (8k + 6)! (8k +5)! 
A(AK)I(4k +2)! 4 (4k)? 6) 
(8k + 3)! (8kK+D!] 
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The series (5) is equivalent to that derived in Dalzell [6, 7] and Borwein ct al. [4], but 
written in a different form. There the sixth order polynomial in (3) is recognized to 
be unchanged when x is replaced by 1 — x, and so additional algebra is performed to 
reformulate the integral in (3) as 


1 
| (3420 —x«x)- nae — xy — (1 -»'] xa — x) dx 
0] 


— 34K? (4k + DP _ (4k + 2)? a (4k +3)P 6) 
~ (BkK+ 1) ° (8K+3)! 28K +5) (BRET) 


where we have applied (4) with m =n several times. To show that the bracketed ex- 
pression in (5) is equivalent to (6), it is easiest to factor both of them, and incidentally 
get a cleaner solution. In both cases, start by giving them the common denominator 
(8k + 7)!, and take out the common factor (4k)!*. It quickly becomes apparent that 
(4k + 1)(4k + 2)(4k 4+ 3) is also a common factor, and both expressions lead to 


(820k? + 1533k" + 902k + 165) 


k=0 


(8k +7)! 
a) 543 77 

i eee seen Oe See 7) 
7 15015 | 594914320 104187267600 


The convergence rate can be found by applying Stirling’s approximation for the tacto- 
rials (see for example [1, 6.1.37]) and taking the ratios of successive terms, giving that 
each term has magnitude roughly 1/1024 of the previous term, or roughly 3 decimal 
digits of accuracy are added per term. Using just the first two terms, and knowing that 
the error when truncating an alternating series with terms decreasing in magnitude is 
less than or equal to the absolute value of the first term in the truncated part, we can 
form the bound 


22 19 22 19 
< < 


7 ~ T5018 =" = 77 T5005" 


(8) 


which is of width 2.53 x 10~*. This result is poorer than the bound from the previous 
section, but if we use three terms, then 


22 19 543 22 19 343 
LS + : (9) 
7 15015 594914320 7 15015 = 594914320 


which is a bound of width 1.83 x 107°, an improvement. 


2. RELATED FAMILIES OF INTEGRALS. There turn out to be a number of fam- 
ilies of integrals that are similar in style to (1). The most obvious is originally duc to 
Nield [12] in 1982, who introduces 


a oad — x)" 
i= —————d 10 
4 [ eee x (10) 


for positive integers n. Then / from (1) is equivalent to 74. Medina [11] has investigated 


this set of integrals in detail, where the upper bound in the integral has becn replaced by 
x, and he has used these integrals to develop polynomial approximations to arctan(x) 
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with rational coefficients. From our perspective, one of the most useful results he gives 
is the closed form expression 


x7 = x)" : . n-t! ; ; (—4)" 
=(x 4 5 5) 4 Ay? 4 —A4 n—1|--k 4k Pe’ 4k . 
rece (x x” + 5x x pe ) x"(1— x) oT 93 
(11) 


While this equation is not proven explicitly in [11], a proof by mathematical induction 
is straightforward. Integrating (11) using (4) and simplifying leads to 


dx 


(—-1)" [ x" - x)? 
Ar-l 0 }+ x2 


(820k* + 1533k? + 902k + 165), 


nal 4—2k It ! 
=— yi 12 (4k)! (4k 4 3)! 
ra (8k + 7)! 
where the integration and simplification were already done earlier for Dalzell’s series 
expansion. So in fact the closed form expression for (—1)" La, /4'—" is equivalent to the 
error when approximating z by a truncated version of Dalzell’s series expansion! 
Tn 1995, Backhouse [2} generalized (1) to 


Loum ] = n 
n= / EEN ye oe SOS: (12) 
o L+x? 


where a, b, and c are rationals that depend on the positive integers m and n, and a and 
b have opposite sign. In this case, / = f44. Backhouse [2] showed that if 2m —n = 0 
(mod 4), then c = 0 and approximations to 7 result. In what follows we shall assume 
that this is the case. An integral equal to a + ba gives the approximation —a/b for z. 
As m and n increase, the integrand becomes increasingly flat (Backhouse calls them 
“pancake functions”) and the approximations to 7 improve as well. Unfortunately, 
there is no straightforward formula relating a and / directly to m and n as in the Uy, 
case. However, Weisstein [14] at least states the result 


Tina _ 2D eT (im + DrPH + 1) 
m+] m+2 mt+n+2 mtn+3_ \ 
ee De € Ge 2 , 2 , ‘ 


We previously saw that error bounds for / could be found using the bounds 1 < 
1 +x? <2. The same approach for J,,,, directly leads to 


min! 'y"(1— xy" mint 
————— <atbr= —_——— dx < -———__., 
2(m +n+ 1)! 0 14+x (m+n-+ 1)! 


where a and arc the rationals depending on m and n. As m and n increase, the bounds 
on the error decrease with reasonable rapidity. 


2.1. Series expansion. Given the closed form expression (11), we can follow the 
same process as Dalzell to produce series expansions for 7, with a specific value of a 
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leading to 
oo ant 1 
t= ss (ayant f (x® Ls 4x° zi 5x4 ts Ax? ae A) xin) (1 co x) erem dx, 
m=0 k=0 0 


(13) 
which generalizes (3). Evaluating the integrals as before leads to 


nal 


2 Oe “42-8 ee a (8200? + 153302 + 9020 + 165), (14) 


k=0 


where a9 = k + 2m. With n = 1, (14) is exactly Dalzell’s expansion (7). With n = 2 
we have 


Oo 
i en + 3)! 
_ 2-2m 3 2 


m0 


eae 
1640m? + 3993m? + 3214m + 855 
jones een Sa ] 
47171 16553 64615651 


~ 15015 | 18150270600 | 102659859353904652800 


Note that this is not an alternating series, and each term is roughly 1/2”° of the previ- 
ous, or roughly 6 digits of accuracy are added per term. 

There is no reason why we can’t take 7 as large as we like. While this increases the 
amount of work to find each term in the series, each term is roughly 1/2'™ the size 
of the previous term, or roughly 3x digits of accuracy are added with each term. In 
principle, there is no reason why a series where each term adds one hundred digits or 
more of accuracy cannot be explicitly written down from (14) with n > 33. 


3. INTEGRALS LEADING TO CONVERGENTS. The main reason for appreci- 
ating the elegance of (1) surely is that it approximates 7 by 22/7, the classic and most 
well-known rational approximation both within and outside the mathematics commu- 
nity. The number 22/7 is particularly good because it is better than other rational 
approximation p/g for g < 57. In fact it is one of the convergents of the continued 
fraction approximation to z, the first few of which are 3, 22/7, 333/106, 355/113, 
103993 /33102, and 104348/33215. There are many excellent texts on continued frac- 
tions, including Chapter 10 of Hardy and Wright [8]. A natural question, then, is 
whether there are integrals similar to the ones shown here that lead to other convergents 
of zr. 

Unfortunately, none of the integrals considered so far lead to approximations to z 
related to the other convergents of 2. The 22/7 in (1) must be considered a happy coin- 
cidence. However, Lucas [9] developed a set of integrals with nonnegative integrands 
that equalled 355/113 — w, where 355/113 is the next particularly good approxima- 
tion to 7. Here, we generalize those results, and show how integrals with nonnegative 
integrands can be formed for z — m (if z > 7) orm — z (if z < 7), with any real z. 

We begin by noting that /,,., from (12) is a combination of multiples of 1, 7, and 
In 2. Now consider the related integral 

lL ym a 2 
es) x" — x) SPORE) (15) 
: 0 1+ x? 
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which can be evaluated as a combination of 1, 7, and In2, where the coefficients 
depend on a, b, and c. [f we want I’ to equal z — z, this leads to a set of three linear 
equations in a, 6, and c, which is easily solved. However, there is no guarantee that 
our resulting integrand is nonnegative. We need to ensure m and n are large enough 
that a + bx + cx? > 0 for x € [0, 1]. The closer z is to 7, the larger m and v will need 
to be. As m and n increase, the coefficients a, b, and c become increasingly large, and 
so a “best” solution can be found, in the sense that the number of characters required 
to form the integrand is minimal. Using Maple code (available from the author) to list 
the various solutions verifies that (1) is the simplest integrand leading to 22/7, and that 
the simplest results for other continued fraction approximations to 7 are 


i x9(1 — x)9(197 + 462x7) pase. 333 
0 530(1 + x2) ite 106° 
[ x¥(1 = x)'(25 + 816x") | 355 
P 3164(1 + x?) as en 
[ x41 — x)!2(124360 + 77159x2) F 103993 
SSE SE, 
; 755216(1 + x?) 33102 
and 
[ x!2(1 ~ x)'2(1349 — 1060x?) 104348 
= S70 
0 38544(1 + x2) “= 33215 


To conclude, there are a variety of further directions for experimentation. Are there 
other integrals with nonnegative integrands leading to approximations of 7 apart from 
(15) that are worth considering? Also, are there other constants that can be approxi- 
mated, and hence new series approximations developed using these techniques? For 
example, (15) could be used to find approximations to In 2. 
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New Proofs of Euclid’s and Euler’s Theorems 


Juan Pablo Pinasco 


In this note we give a new proot of the existence of infinitely many prime numbers. 
There are several different proofs with many variants, and some of them can be found 
in (1,3, 4, 5, 6]. This proof is based on a simple counting argument using the inclusion- 
exclusion principle combined with an explicit formula. A different proof bascd on 
counting arguments is due to Thue (1897) and can be found in [6] together with sev- 
eral generalizations, and a remarkable variant of it was given by Chaitin [2] using al- 
gorithmic information theory. Moreover, we prove that the series of reciprocals of the 
primes diverges. Our proofs arisc from a connection between the inclusion-exclusion 
principle and the infinite product of Euler. 

Let {p;}; be the sequence of prime numbers, and let us define the following recur- 
rence: 


1—- ak 
a = 0, Apt =A + ‘ 
Pr+i 


Let us note that the Nth term ay generated by this recurrence coincides with 


a= oet DD ei, 


jf PIP) SL PiPIPR Pi-++ Py 


and can be given in a closed form as 


which implies that 0 < ay < I, since each factor is strictly positive and Iess than one. 
Now, we are ready to prove the classical Euclid’s theorem: 


Theorem 1. There are infinitely many prime numbers. 


Proof, Let us suppose that pj < po <--+ < pw are all the primes. For any x > 1, 
and fori = 1,...,N, let A; be the set of integers in [1, x] that are divisible by );. 
Then, the number of positive integers in [1, x] is obtained by applying the inclusion- 
exclusion formula to find the cardinality of U®_, A;: 


i=l 


x x x x 
EE) Ebel Zbl lead 
2 Pi > PiP; a Pi Pj Pk P\--+ PN 
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“Ramanujan’s series for 1/7: A survey,” American Mathematical Monthly, vol. 116 
(2009), p. 567-587. Copyright 2009 Mathematical Association of America. All 
Rights Reserved. 


Synopsis: 
In this piece, the authors discuss some formulas for 1/7 originally discovered 
by Ramanujan. One of these is the formula 


1 v8 s (4n)!(1103 + 26390n) 
x 9801 (n!)43964" 


n=0 
which was used by Gosper in 1985 to compute 7 to compute 17,526,100 decimal dig- 
its of 7. Using the same mathematical approach, David and Gregory Chudnovsky 
subsequently deduced the formula 


Co 


i py (—1)"(6n)! 13591409 + 545140134n 
— (nl)8(Bn)! 6403203"+3/2 


which was then used by them to compute 2,260,331,336 digits. Results are presented 
here for numerous other series of this general form. 


Keywords: Computation, History, Series 
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Ramanujan’s Series for 1/7: A Survey* 


Nayandeep Deka Baruah, Bruce C. Berndt, and Heng Huat Chan 


In Memory of V. Ramaswamy Aiyer, 
Founder of the Indian Mathematical Society in 1907 


When we pause to reflect on Ramanujan’s life, we see that there were certain 
events that seemingly were necessary in order that Ramanujan and his mathemat- 
ics be brought to posterity. One of these was V. Ramaswamy Aiyer's founding 
of the Indian Mathematical Society on 4 April 1907, for had he not launched the 
Indian Mathematical Society, then the next necessary episode, namely, Ramanu- 
jan’s meeting with Ramaswamy Aiyer at his office in Tirtukkoilur in 1910, would 
also have not taken place. Ramanujan had carried with him one of his notebooks, 
and Ramaswamy Aiyer not only recognized the creative spirit that produced its 
contents, but he also had the wisdom to contact others, such as R. Ramachandra 
Rao, in order to bring Ramanujan’s mathematics to others for appreciation and 
support. The large mathematical community that has thrived on Ramanyjan’s 
discoveries for nearly a century owes a huge debt to V. Ramaswamy Aiyer. 


1. THE BEGINNING. Toward the end of the first paper [57], [58, p. 36] that 
Ramanujan published in England, at the beginning of Section 13, he writes, “I shall 
conclude this paper by giving a few series for 1/m.” (In fact, Ramanujan concluded 
his paper a couple of pages later with another topic: formulas and approximations for 
the perimeter of an ellipse.) After sketching his ideas, which we examine in detail 
in Sections 3 and 9, Ramanujan records three series representations for 1/7. As is 
customary, set 


(a)o = 1, (4), >= a(a4t1)---(atn—-D, n> 1. 


An ‘= ae n>0. (1.1) 


Theorem 1.1. If A, is defined by (1.1), then 


a, 1 
— =) \(6n+ Az, (1.2) 
eer 4r 

2 1 
18 (420 + Anse, (3) 
xf 26" 


5 (1.4) 


aot ae es oe 
52.= 9° (aavi-+20045V5—1) Auge ( ) 


*This paper was originally solicited by the Editor of Mathematics Student to commemorate the founding of 
the Indian Mathematical Society in its centennial year. Mathematics Student is one of the two official journals 
published by the Indian Mathematical Society, with the other being the Journal of the Indian Mathematical 
Society. The authors thank the Editor of Mathematics Student for permission to reprint the article in this 
MONTHLY with minor changes from the original, 
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The first two formulas, (1.2) and (1.3), appeared in the Walt Disney film High 
School Musical, starring Vanessa Anne Hudgens, who plays an exceptionally bright 
high school student named Gabriella Montez. Gabriella points out to her teacher that 
she had incorrectly written the left-hand side of (1.3) as 8/z instead of 16/z on the 
blackboard. After first claiming that Gabriella is wrong, her teacher checks (possibly 
Ramanujan’s Collected Papers?) and admits that Gabriella is correct. Formula (1.2) 
was correctly recorded on the blackboard. 

After offering the three formulas for 1/2 given above, at the beginning of Section 
14 [57], [58, p. 37], Ramanujan claims, “There are corresponding theories in which ¢ 
is replaced by one or other of the functions” 


oF; (4, =; 1; 1-2) 
qr i= Gr {X) s= xp | —7 cl Cr a ; (1.5) 
where r = 3, 4, or 6, and where 27, denotes one of the hypergeometric functions 
pF p-1, p = 1, which are defined by 


(21) eae (@p)n x” 


——  ,” 1. 
(in ious (Bp-1)n n! a 7 


pF p-1 (aj.44. 503 DigcecsBpSite) =>: 


n=0 


(The meaning of g is explained in Section 3.) Ramanujan then offers 14 further series 
representations for 1/2. Of these, 10 belong to the quartic theory, i.e., for r = 4; 2 
belong to the cubic theory, i.e., for r = 3; and 2 belong to the sextic theory, i.e., for 
r = 6. Ramanujan never returned to the “corresponding theories” in his published 
papers, but six pages in his second notebook [59] are devoted to developing these 
theories, with all of the results on these six pages being proved in a paper [16] by 
Berndt, S. Bhargava, and F. G. Garvan. That the classical hypergeometric function 
2Fi( 3, 5 1; x) in the classical theory of elliptic functions could be replaced by one of 
the three hypergeometric functions above and concomitant theories developed is one of 
the many incredibly ingenious and useful ideas bequeathed to us by Ramanujan. The 
development of these theories is far from easy and is an active area of contemporary 
research, 

All 17 series for 1/1 were discovered by Ramanujan in India before he arrived 
in England, for they can be found in his notebooks [59], which were written prior to 
Ramanujan’s departure for England. In particular, (1.2), (1.3), and (1.4) can be found 
on page 355 in his second notebook and the remaining 14 series are found in his third 
notebook [59, p. 378]; see also [14, pp. 352-354]. It is interesting that (1.2), (1.3), 
and (1.4) are also located on a page published with Ramanujan’s tost notebook [60, 
p. 370); see also [3, Chapter 15). 


2. THE MAIN ACTORS FOLLOWING IN THE FOOTSTEPS OF RAMANU- 
JAN. Fourteen years after the publication of [57], the first mathematician to address 
Ramanujan’s formulas was Sarvadaman Chowla [37], [38], [39, pp. 87-91, 116-119], 
who gave the first published proof of a general series representation for 1/7 and used 
it to derive (1.2) of Ramanujan’s series for t/m [57, Eq. (28)]. We briefly discuss 
Chowla’s ideas in Section 4. 

Ramanujan’s series were then forgotten by the mathematical community until 
November, 1985, when R. William Gosper, Jr. used one of Ramanujan’s series, namely, 


9801 3 (4n)! (1103 + 26390n) 


Se \ oe eae 2.1 
rV8 <x (nt396" ro 
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to calculate 17,526,100 digits of 2, which at that time was a world record. There was 
only one problem with his calculation—(2.1} had not yet been proved. However, a 
comparison of Gosper’s calculation of the digits of 1 with the previous world record 
held by Y. Kanada made it extremely unlikely that (2.1) was incorrect. 

In 1987, Jonathan and Peter Borwein [23] succeeded in proving all 17 of Ramanu- 
jan’s series for 1/7. In a subsequent series of papers [24], [25], [29], they established 
several further series for 1/7, with one of their series [29] yielding roughly fifty dig- 
its of x per term. The Borweins were also keen on calculating the digits of 7, and 
accounts of their work can be found in [30], [28], and [26]. 

At about the same time as the Borweins were devising their proofs, David and Gre- 
gory Chudnovsky [40] also derived series representations for 1/2 and, in particular, 
used their series 


1 433914 4514 
= 2D St (6n)! 3591409 + 545140134n 


n!)3 Gn)! (640320)3"+3/2 (2.2) 


to calculate a world record 2,260,331,336 digits of 2. The series (2.2) yields 14 digits 
of z per term. A popular account of the Chudnovskys’ calculations can be found in a 
paper written for The New Yorker [56]. 

The third author of the present paper and his coauthors (Berndt, S. H. Chan, A. Gee, 
W.-C. Liaw, Z.-G. Liu, V. Tan, and H. Verrill) in a series of papers [19], [31], [33], [34], 
[36] extended the ideas of the Borweins, in particular, without using Clausen’s formula 
in [31] and [36], and derived general hypergeometric-like formulas for 1/7. We devote 
Section 8 of our survey to discussing some of their results. 

Stimulated by the work and suggestions of the third author, the first two authors 
[9], [7] systematically returned to Ramanujan’s development in [57] and employed 
his ideas in order not only to prove most of Ramanujan’s original representations for 
1/2 but also to establish a plethora of new such identities as well. In another paper 
(8], motivated by the work of Jesus Guillera [48]-(53], who both experimentally and 
rigorously discovered many new series for both 1/s and 1/7, the first two authors 
continued to follow Ramanujan’s ideas and devised series Tepresentations for 1/1. 

In the survey which follows, we delineate the main ideas in Sections 3, 6, 7, 8, 
and 9, where the ideas of Ramanujan, the Borwein brothers, the Chudnovsky brothers, 
Chan and his coauthors, and the present authors, respectively, are discussed. 


3. RAMANUJAN’S IDEAS. To describe Ramanujan’s ideas, we need several defi- 
nitions from the classical theory of elliptic functions, which, in fact, we use throughout 
the paper. The complete elliptic integral of the first kind is defined by 


= ROS a Pee ee (3.1) 
0 


1—k?sin* g 


where k, 0 < k < 1, denotes the modulus. Furthermore, K’ := K (k’), where k’ := 
J/1 — k? is the complementary modulus. The complete elliptic integral of the second 
kind is defined by 


n{2 
E:= E(k) := [ yl —k sin’ g dg. (3.2) 
0 
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If g = exp(—1 K'/K), then one of the central theorems in the theory of elliptic 
functions asserts that [13, p. 101, Entry 6] 


y°(q) = kw =F! a (3.3) 


where g(q) in Ramanujan’s notation (or #3(g) in the classical notation) denotes the 
classical theta function defined by 


¢(q) = Da qi. (3.4) 


Note that, in the notation (1.5), g = g, and x = k*. The second equality in (3.3) fol- 
lows from expanding the integrand in a binomial series and integrating termwise. Con- 
versely, it is also valuable to regard k as a function of g, and so we write k = k(q). 
Let K, K’, L, and L’ denote complete elliptic integrals of the first kind associated 
with the moduli k, k’, 2, and £’, respectively. Suppose that, for some positive integer 7, 


K’ so 

or ae ie (3.5) 
A modular equation of degree n is an equation involving k and £ that is induced by 
(3.5). Modular equations are always algebraic equations. An example of a modular 
equation of degree 7 may be found later in (9.18). Alternatively, by (3.3), (3.5) can 
be expressed in terms of hypergeometric functions. We often say that @ has degree n 
over k. Derivations of modular equations ultimately rest on (3.3). If we set K’/K = 
/n, so that g = e~"¥", then the corresponding value of k, which is denoted by &, := 
k(e7™¥*), is called the singular modulus. The multiplier m = m(q) is defined by 


(3.6) 


Fi (4,451; 2 
aes m(q) —_ 2f) (3 : (q)) 
2 


2F, (4, 431; €(@)) 


We note here that, by (3.3), (3.6), and [13, Entry 3, p. 98; Entry 25(vii), p. 40], m(q) 
and k?(q) can be represented by 


2 47,2 
v*(q) a reat 


ma a ~ OT OA(g) 


’ 


respectively, where o(q) is defined by (3.4) and 
via) = do gisrn?. 


Thus, modular equations can also be written as theta function identities. 
Ramanujan begins Section 13 of [57] with a special case of Clausen’s formula [23, 
p. 178, Proposition 5.6(b)], 


os = 2 1 RS f 
“Sa ek = Fi (S 4 1451, 1; Qkk?), G.7) 
= ( 


570 © THE MATHEMATICAL ASSOCIATION OF AMERICA [Monthly 116 


19. RAMANUJAN’S SERIES FOR 1/7: A SURVI 


@ 


YY (2009) 


which can be found as Entry 13 of Chapter 11 in his second notebook [59] [12, p. 58]. 
Except for economizing notation, we now quote Ramanujan. “Hence we have 


qq; gt, = (je) > G); (2kk')2/ (3.8) 
s co 4 rar Gg! , . 
where 
(4: @)oo = (1 — a)(1 — ag){1 — aq’) +++ (3.9) 


Logarithmically differentiating both sides in (3.8) with respect to k, we can easily shew 
that 


oC 2) x 143 
JG 2 : (3); 1 2j 
1-24 - = (1 — 2k?) S°Gj + 12 (2kk')4. 3.10) 
ser, d 7G» 
But it follows from 
3 x j K 2 
i= -2#45)° ——— = [= 
aT Sear an (=) A(k) (3.11) 


j=l 


where A(k) is a certain type of algebraic number, that, when gq = ev"v" » being a 
rational number, the left-hand side of (3.10) can be expressed in the form 


2 
rears 
4 Xu 
where A and B are algebraic numbers expressible by surds. Combining (3.7) and 
(3.10) in such a way as to eliminate the term (2K /7r)?, we are left with a series for 
1/2.” He then gives the three examples (1.2)—-(1.4). 
Ramanujan’s ideas will be described in more detail in Section 9. However, in clos- 


ing this section, we note that the series on the left-hand sides of (3.10) and (3.11) is 
Ramanujan’s Eisenstein series P(q*), with g = e-*V" in the latter instance, where 


id 
a igi <1. (3.12) 


P(q):=1-24)> 

j=! 

Ramanujan’s derivation of (3.11) arises firstly from the transformation formula for 

P(q), which in turn is an easy consequence of the transformation formula for the 

Dedekind eta-function, given in (9.11) below. The second ingredient in deriving (3.11) 

is an identity for n P(q™") — P(q*) in terms of the moduli k and £, where £ has degree 

n over k. Formula (3.8) follows from a standard theorem in elliptic functions that 
Ramanujan also recorded in his notebooks [59], [13, p. 124, Entry 12(iii)]. 


4. SARVADAMAN CHOWLA. Chowla’s ideas reside in the classical theory of 
elliptic functions and are not unlike those that the Borweins employed several years 
later. We now briefly describe Chowla’s approach [37], [38], [39, pp. 87-91, 116— 
119]. Using classical formulas of Cayley and Legendre relating the complete elliptic 
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integrals K and E, defined by (3.1) and (3.2), respectively, he specializes them by 
setting K /K’ = ./n. He then defines 


co dy3 ; 
S.:= + if Beane’? (4.1) 
f=0 


and 


oo 1y2 
T= pa Qj 


id Dkk! 4.2 
2. Gye ) (4.2) 


Chowla then writes “Then it is known that, when k < 1/./2,” 


2K 4K? 
Sa: and. (4.3) 
oi 


2 
Chowla does not give his source for either formula, but the second formula in (4.3), as 
noted above, is a special case of Clausen’s formula (3.7). The first formula is a special 
case of Kummer’s quadratic transformation [23, pp. 179-180}, which was also known 
to Ramanujan. Each of the formulas of (4.3) is differentiated twice with respect to k, 
and, without giving details, Chowla concludes that if K/K’ = ./n, then 


1 
a Tr T), 
k a+ Blo + eh 
K 
— dT, + eh, 
1 
| 
7S a2Sp + 528, 
x 
1 
cies + b3Sy + €3S) + 3S, 
“where a,,;,... are algebraic numbers.” He then sets n = 3 and k = sin(z/12) in 


each of the four formulas above to deduce, in particular, identity (1.2) from the second 
formula above. 


5. R. WILLIAM GOSPER, JR. As we indicated in the Introduction, in November, 
1985, Gosper employed a lisp machine at Symbolics and Ramanujan’s series (2.1) to 
calculate 17,526,100 digits of 2, which at that time was a world record. (During the 
1980s and 1990s, Symbolics made a lisp-based workstation running an object-oriented 
programming environment. Unfortunately, the machines were too expensive for the 
needs of most customers, and the company went bankrupt before it could squeeze the 
architecture onto a chip.) Of the 17 series found by Ramanujan, this one converges 
the fastest, giving about 8 digits of 7 per term. At the time of Gosper’s calculation, 
the world record for digits of x was about 16 million digits calculated by Y. Kanada. 
Before the Borwein brothers had later found a “conventional” proof of Ramanujan’s 
series (2.1), they had shown that either (2.1) yields an exact formula for z or that it 
differs from x by more than 10-70. Thus, by demonstrating that his calculation of 
x agreed with that of Kanada, Gosper effectively had completed the Borweins’ first 
proof of (2.1). However, Gosper’s primary goal was not to eclipse Kanada’s record 
but to study the (simple) continued fraction expansion of x for which he calculated 
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17,001,303 terms. In email letters from February, 1992 and May, 1993, Gosper offered 
the following remarks on his calculations: 


Of course, what the scribblers always censor is that the digits were a by-product. 
I wanted to change the object of the game away from meaningless decimal digits. 

I used what I call a resumable matrix tower to exactly compute an enormous 
rational equal to the sum of a couple of million terms of Ramanujan’s 99-™ 
series. I then divided to form the binary integer = floor(2r2>®-90-™), | converted 
this to decimal and sent a summary to Kanada for comparison, and converted the 
binary fraction to a cf using an fft based scheme. 


Gosper’s world record was short lived, as in January, 1986, D. H. Bailey used an 
algorithm of the Borweins arising from a fourth-order modular equation to compute 
29,360,000 digits of 2. Gosper’s calculation of the continued fraction expansion of 
wt was motivated by the fact that many important mathematical constants do not have 
interesting decimal expansions but do have interesting continued fraction expansions. 
That continued fraction expansions are considerably more interesting than decimal ex- 
pansions is a view shared by the Chudnovsky brothers [43]. Continued fraction expan- 
sions can often be used to distinguish a constant from others, while decimal expansions 
likely will be unable to do so. For example, the simple continued fraction of e, namely, 


Pes ae ee ee ee 
CMO THD+T+1+44+14+1+64+14+-° 


has a pattern. On the contrary, taking a Jarge random string of digits of e would not 
help one identify e¢. It is an open problem if the simple continued fraction of z, namely, 


gh “hs a ES a ea) 
RS Sas ee a oP ee Pe ee es 
oe: at sed a ag a eae A aha a 
2474+ 74+24+24+24+24+14+844+2+1+4+1415 


1 1 1 1 1 1 
+34+13+14+44+2+76+4-:: 
has a pattern. 
Gosper also derived a hypergeometric-like series representation for 1, namely, 


—~ 507 — 6 
x=) 7 =, (5.1) 


which can be used to calculate any particular binary digit of 2. See a paper by 
G. Almkvist, C. Krattenthaler, and J. Petersson [1] for a proof of (5.1) as well as 
generalizations, which include the following theorem. 


Theorem 5.1. For each integer k > 1, there exists a polynomial S,(j) in j of degree 
4k with rational coefficients such that 


S&C) 


od Sarin ont 
78 (ay) OY 
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6. JONATHAN AND PETER BORWEIN. One key to the work of both Ramanu- 
jan and Chowla in their derivations of formulas for 1/2 is Clausen’s formula for the 
square of a complete elliptic integral of the first kind or, by (3.3), for the square of 
the hypergeometric function 2F (3, 4; 1; 2). The aforementioned rendition (3.7) of 
Clausen’s formula is not the most general version of Clausen’s formula, namely, 


2F} (a,b;a+b+ 4; z) = 3F; (2a, 2b,a+ bia +b+ 5, 2a + 2b; z). (6.1) 


Indeed, the work of many authors who have proved Ramanujan-like series for 1/xz 
ultimately rests on special cases of (6.1). In particular, squares of certain other hyper- 
geometric functions lead one to Ramanujan’s alternative theories of elliptic functions. 

A second key step is to find another formula for (K /2), which also contains an- 
other term involving 1/7. Combining the two formulas to eliminate the term (K /7)? 
then produces a hypergeometric-type series representation for 1/7. Evidently, unaware 
of Chowla’s earlier work, the Borweins proceeded in a similar fashion and used Leg- 
endre’s relation {23, p. 24] 


E(k) K'(k) + E'(k)K (k) — K(k) K(k) = 5 


and other relations between elliptic integrals to produce such formulas. 

Having derived a series representation for 1/2, one now faces the problem of eval- 
uating the moduli and elliptic integrals that appear in the formulas. If gq = e~*¥", then 
for certain positive integers » one can evaluate the requisite quantities. This leads us 
to the definition of the Ramanujan—Weber class invariants. After Ramanujan, set 


x(q) = (G3 q7)oos lal <1, (6.2) 


where (a; q)oo is defined by (3.9). If n is a positive rational number and g = e7***, 
then the class invariants G,, and g,, are defined by 


G2 Mg yg) and gy = 24g" (—-g). (6.3) 


In the notation of H. Weber [63], G, = 27'/*f(./—n) and gy = 27'/4f,(./—n). As 
mentioned in Section 3, k, := k(e77¥”) is called the singular modulus. In his volumi- 
nous work on modular equations, Ramanujan sets a := k* and B := 2’. Accordingly, 
we set a, := k?. Because (13, p. 124, Entries 12(v), (vi)] 


x(q) = 2'fa(l—a)/gy'™* = and x (-g) = 2! fa — a)? /g} 
it follows from (6.3) that 
G, = {4a,(l—a,)} 4 = and =, = (4a, (1 —a,) 7} "4. (6.4) 


In the form (6.4), the class invariant G, appears on the right-hand sides of (3.7) and 
(3.10), and consequently the values of G,, for several values of n are important in de- 
riving certain series for 1/7. It is known that if n is square-free and n = 1 (mod 4) then 
G* is a real unit that generates the Hilbert class field of the quadratic field Q(./—n) 
[32, Cor. 5.2], and this fact is very useful in evaluating G,. When we say that a series 
for 1/x is associated with the imaginary quadratic field Q(./—n), we mean that the 
constants involved in the series are related to the generators of the Hilbert class field 


of Q(./—n). 


574 © THE MATHEMATICAL ASSOCIATION OF AMERICA [Monthly 116 


19. RAMANUJAN’S SERIES FOR 1/7: A SURVI 


@ 


YY (2009) 


Singular moduli and class invariants are actually algebraic numbers. In general, as 
n increases, the corresponding series for 1/2 converges more rapidly. The series (2.2) 
is associated with the imaginary quadratic field Q(./—163). 

The Borweins’ proofs of al! 17 of Ramanujan’s series for | /2 can be found in their 
book [23]. Their derivations arise from several general hypergeometric-like series rep- 
resentations for 1/7 given in terms of singular moduli, class invariants, and complete 
elliptic integrals [23, pp. 181-184]. Another account of their work, but with fewer de- 
tails, can be found in their paper [24] commemorating the centenary of Ramanujan’s 
birth. Further celebrating the 100th anniversary of Ramanujan’s birth, the Borweins 
derived further series for 1/2 in [25]. The series in this paper correspond to imaginary 
quadratic fields with class number 2, with one of their series corresponding to n = 427 
and yielding about 25 digits of x per term. In [29], the authors derived series for 1/7 
arising from fields with class number 3, with a series corresponding to n = 907 yield- 
ing about 37 or 38 digits of 7 per term. Their record is a series associated with a field 
of class number 4 giving about 50 digits of z per term; here, n = 1555 [28]. The latter 
paper gives the details of what we have written in this paragraph. 

The Borweins have done an excellent job of communicating their work to a wide 
audience. Besides their paper [28], see their paper in this MONTHLY [30], with 
D. H. Bailey, on computing , especially via work of Ramanujan, and their delightful 
paper in the Scientific American [26], which has been reprinted in [22, pp. 187-199] 
and [11, pp. 588-595]. 


7. DAVID AND GREGORY CHUDNOVSKY. In our Introduction we mentioned 
that the Chudnovsky brothers, Gregory and David, used (2.2) to calculate over 2 billion 
digits of z. They had first used (2.2) to calculate 1,130,160,664 digits of 7 in the fall 
of 1989 on a “borrowed” computer. They then built their own computer, “‘m zero,” 
described colorfully in [56], and set a world record of 2,260,321,336 digits of 2. The 
world record for digits of 27 has been broken several times since then, and since it is 
not the purpose of this paper to delineate this computational history, we refrain from 
mentioning further records. 

The Chudnovskys, among others, have extensively examined their calculations for 
patterns. It is a long outstanding conjecture that 2 is normal. In particular, for each k, 
0<k <9, 


__ # of appearances of k in the first N digits of x 1 
lim =—. 
N20 N 10 


The Chudnovskys’ calculations, and all subsequent calculations of Kanada, lend cre- 
dence to this conjecture. As a consequence, the average of the digits over a long in- 
terval should be approximately 4.5. The Chudnovskys found that for the first billion 
digits the average stays a bit on the high side, while for the next billion digits, the 
average hovers a bit on the low side. Their paper [43] gives an interesting statistical 
analysis of the digits up to one billion. For example, the maximal length of a string of 
identical digits, for each of the ten digits, is either 8, 9, or 10. 

The Chudnovskys deduced (2.2) from a general series representation for 1/m, which 
we will describe after making severa] definitions. For t € 7H = {t : Imt > 0} and 
each positive integer k, the Eisenstein series E,(1) is defined by 


4k =. “yj mit 
Ex(t) =1—5- Dlowaa, gg =e™, (7.1) 
2k Fay 
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where B,, k > 0, denotes the kth Bernoulli number and o;,(n) = >- d\n d*. Klein’s ab- 
solute modular J-invariant is defined by 
A) 


IO) = 55) — Ee) 


tTeEH. (7.2) 


It is well known that if a(g) := k*(q), where k is the modulus, then [13, pp. 126-127, 
Entry 13] 


4 (1 —a(g) +.07(q))° 
27Ta2(q)(1 — a(g))? 


Thus, (6.4) and (7.3) show that, when g = ern singular moduli, class invariants, 
and the modular /-invariant are intimately related. Now define 


3 
(zx) * Tr) , 


We are now ready to state the Chudnovskys’ main formula [44, p. 122]. If tr = 
(1+ /—n)/2, then 


ieee eee IO ae 
6S NTT EY GayieP 1724) Oe nl — J) 


(7.4) 


J(2t) = (7.3) 


E4(t) 
E(t) 


$2(T) = 


u=0 


The Chudnovskys’ series (2.2) is the special case n = 163 of (7.4). 

The Chudnovsky brothers developed and extended Ramanujan’s ideas in directions 
different from those of other authors. They obtained hypergeometric-like representa- 
tions for other transcendental constants and proved, for example, that 


T (za) 


are transcendental [42]. Their advances involve the “second” solution of the hyperge- 
ometric differential equation. Recall from the theory of linear differential equations 
that 2 F\ (a, b; c; x) is a solution of a certain second-order linear differential equation 
with a regular singular point at the origin [5, p. 1]. A second linearly independent so- 
lution is generally not analytic at the origin, and in [43] and [44, pp. 124-126], the 
Chudnovsky brothers establish new hypergeometric series identities involving the lat- 
ter function. Their identities lead to hypergeometric-like series representations for 77, 
including Gosper’s formula (5.1). In [45], the authors provide a lengthy list of such 
examples, including 


oO 2 a 
457 + 644 = e 8/ (430 7° — 62407 — 520) 


i= (9) 


The Chudnovsky brothers have also employed series for 1/7: to derive theorems on 
irrationality measures z(a), which are defined by 


Pp 
es 


(ay s=int Ju > 0 :O0< 
q 


] Pee F P 
So has only finitely many solutions — € Q}. 
q q 
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By the famous Thue—Siegel—Roth Theorem [6, p. 66], 


= 1, if a is rational, 
uta) += 2, if a is algebraic but not rational, 
> 2, if w is transcendental. 


Although the Chudnovskys can obtain irrationality measures for various constants, the 
one they obtain for 7 is not as good as one would like. Currently, the world record 
for the irrationality measure of w is held by M. Hata [54], who proved that (7) < 
8.016045 .... Their methods are much better for obtaining irrationality measures for 
expressions, such as 7/+/640320, arising in their series (2.2). See also a paper by 
W. Zudilin [64]. 


8. RAMANUJAN’S CUBIC CLASS INVARIANT AND HIS ALTERNATIVE 
THEORIES. In Sections 3, 4, 6, and 7, we emphasized how Ramanujan—Weber class 
invariants and singular moduli were of central importance for Ramanujan and others 
who followed in deriving series for 1/7. We also stressed in Section 1, in particular, in 
the discourse after (1.5), that Ramanujan’s remarkable idea of replacing the classical 
hypergeometric function 2F\(3, 5; 1; x) by 2Fi(4, =; 1; x),7 = 3, 4, 6, leads to new 
and beautiful alternative theories. On the top of page 212 in his lost notebook [60}, 
Ramanujan defines a cubic class invariant A,, (i.e., 7 = 3 in (1.5)), which is an ana- 
logue of the Ramanujan—Weber classical invariants G, and g, defined in (6.3). Define 
Ramanujan’s function 


f(-@) = Doo lgl <4, (8.1) 
where (a; G7). is defined in (3.9), and the Dedekind eta-function y(t) 


n(t) = elrit/a4 TIa o eemity =n qi! f(—@), (8.2) 
j=l 


where g = ¢*7* and Im t > 0. Then Ramanujan’s cubic class invariant A, is defined 
by 


(eee) 
ot a 2 ; 83) 
373 JG £'(q?) 33 (45) 
3 


where q = e774", ie, t= hi/n/3. 

Chan, Liaw, and Tan [34] established a general series representation for 1/z in 
terms of i,, that is analogous to the general formulas of the Borweins and Chudnovskys 
in terms of the classical class invariants. To state this general formula, we first need 
some definitions. Define 

1 Te of? tQ) 


a 8.4 
a*(q) 27q f'(q*) os 


Thus, when g = e-*¥"? and @* := a*(e-7¥"), (8.3) and (8.4) imply that 


1 
—=1-22. 
a* 


a 
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In analogy with (3.6), define the multiplier m(g) by 
* * art 
m(q) := m(a*, B*) := 


where 8* = a*(q"). We are now ready to state the general representation of {/m de- 
rived by Chan, Liaw, and Tan [34, p. 102, Theorem 4.2]. 


Theorem 8.1. Forn > 1, let 


ax(1 — oF) dm(a*, £*) 
Jn da* at=l-at Bt=a2 
b, = 1 — 2a%, 


ay = — 


and 


Then 
Le De + b,j) 2s), Bh, (0), Hi. (8.5) 


We give one example. Let n = 9; then aj = 2. Then, without providing further 
details, 


= 2 jG) (3), G), He), 


Gly 16 


which was discovered by Chan, Liaw, and Tan [34, p. 95]. 

Another general series representation for 1/7 in the alternative theories of Ramanu- 
jan was devised by Berndt and Chan (19, p. 88, Eq. (5.80)]. We will not state this for- 
mula and all the requisite definitions, but let it suffice to say that the formula involves 
Ramanujan’s Eisenstein series P(g), O(q) = E4(t), and R(q) = E¢(t) at the argu- 
ment g = —e~"¥* and the modular j-invariant, defined by j(t) = 1728/ (x), where 
J (t) is defined by (7.2). In particular, 

.(3t+v¥—3n Qe = OA 1? 
2 2 , 


I = —27 


a proof of which can be found in [18]. The hypergeometric terms are of the form 
1) (1) (3 
eae (é), (a), 
cL) 


Berndt and Chan used their general formula to calculate a series for 1/7 that yields 
about 73 or 74 digits of x per term. 
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Lastly, we conclude this section by remarking that Berndt, Chan, and Liaw [20] 
have derived series representations for | /z that fall under the umbrella of Ramanujan’s 
quartic theory of elliptic functions. Because the quartic theory is intimately connected 
with the classical theory, their general formulas [20, p. 144, Theorem 4.1] involve the 
classical invariants G,, and g, in their summands. Not surprisingly, the hypergeometric 
terms are of the form 


OR 


The simplest example arising from their theory is given by [20, p. 145] 


9. THE PRESENT AUTHORS AS DISCIPLES OF RAMANUJAN. As men- 
tioned in Section 2, the first two authors were inspired by the third author to continue 
the development of Ramanujan’s thoughts. In their first paper [9], Baruah and Berndt 
employed Ramanujan’s ideas in the classical theory of elliptic functions to prove 13 
of Ramanujan’s original formulas and many new ones as well. In [7], they utilized 
Ramanujan’s cubic and quartic theories to establish five of Ramanujan’s 17 formu- 
las in addition to some new representations. Lastly, in [8], motivated by the work 
of J. Guillera, described briefly in Section 10 below, the first two authors extended 
Ramanujan’s ideas to derive hypergeometric-like series representations for 1/77. For 
example, 


IA oo 1 pwtl 
— = - (44571654400, + 5588768408. + 233588841)B,, (sx) 
ca = 99 


where 


3 3 
= vi Ga —v)! 


In Section 3, we defined Ramanujan’s Eisenstein series P(g) in (3.12) and offered 
several definitions from Ramanujan’s theories of elliptic functions in giving a brief 
introduction to Ramanujan’s ideas. Here we highlight the role of P(g) in more detail 
before giving a complete proof of (1.3). Because these three series representations 
(1.2)+(1.4) can also be found in Ramanujan’s lost notebook, our proof here is similar 
to that given in [3, Chapter 15]. 

Following Ramanujan, set 


z= 2F, (3,4: 15x). (9.1) 


The two most important ingredients in our derivations are Ramanujan’s representation 
for P(g?) given by [13, p. 120, Entry 9(iv)] 


P(q’) = (i — 2x)z* + 6x(1 -— or (9.2) 
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and Clausen’s formula (3.7), which, using (9.1), we restate in the form 


where, as in (1.1), 
Aji= 24+ and SX = 4x(1—x). 
From (9.3) and (9.4), 
ried = > A,jX?-'.4q. — 2x). 
dx im d 
Hence, from (9.2), (9.3), (9.5), and (9.4), 


P(q?) = (1— 2x) 9) A,X? + 3(1 ~ 2x) 0 A, JX! 


j=0 j=0 


= S\{(1 — 2x) + 3(1 — 2x) JA; X?. 


j=0 
For q := e77¥", recall (9.1) and set 
Xp = Re), ty =F (4,45 1 xn) 
and 
X, = 4x, — xX). 
For later use, we note that [3, p. 375] 
1— xX, = Xtjn and Liga = ina 


With the use of (9.7) and (9.8), (9.6) takes the form 


P(e2*v*) = Yita —2x,) +30 - 2xn) fJA;X? 
7=0 


= el - je +3 xe c= 2x,)fA;X). 


j=0 


(9.3) 


(9.4) 


(9.5) 


(9.6) 


(9.7) 


(9.8) 


(9.9) 


(9.10) 


In order to utilize (9.10), we require two different formulas, each involving both 
P(q*) and P(q*"), where n is a positive integer. The first comes from a transformation 
formula for P(g), which in turn arises from the transformation formula for f(—g) 
defined in (8.1) or the Dedekind eta function defined in (8.2), and is for general n. This 


transformation formula is given by [13, p. 43, Entry 27(iit)] 


-a/12,,1/4 2a -B/12 gi/4 -2 
eg te My Sete ite): 


(9.11) 
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where af = 77, with a and f both positive. Taking the logarithm of both sides of 
(9.11), we find that 


pill enti) — Beg. en 
-5 f= Flog + oat ~e joy at log 6 re Yost - iby. 


(9.12) 


Differentiating both sides of (9.12) with respect to w, we deduce that 


Pool BS 2jer*® BL A jBfae#? 
“Daa 21-2 aaa Ay nee) 


Multiplying both sides of (9.13) by 12a and rearranging, we arrive at 


© 5 4-2ja -2j 
je 
cna (1-m ) a) a(1-a3 age OE (9.14) 
= 


Setting a = 2 /,/n, so that 8 = 2./n, recalling the definition (3.12) of P(g), and 
rearranging slightly, we see that (9.14) takes the shape 


6 
ee P(eW*/V") 4 n P(e), (9.15) 
a 
This is the first desired formula. 
The second gives representations for Ramanujan’s function [57], [58, pp. 33-34] 


fn(q) =n P(g") — P(q’) (9.16) 


for certain positive integers n. (Ramanujan [57], [58, pp. 33-34] used the notation 
f(n) instead of f,(q).) In [57], Ramanujan recorded representations for f,(q) for 12 
values of n, but he gave no indication of how these might be proved. These formulas 
are also recorded in Chapter 21 of Ramanujan’s second notebook [59], and proofs may 
be found in [13]. 

We now give the details for our proof of (1.3), which was clearly a favorite of 
Gabriella Montez, the precocious student in High School Musical. Unfortunately, we 
do not know whether she possessed a proof of her own. We restate (1.3) here for 
convenience. 


Theorem 9.1. if A;, j > 0, is defined by (9.4), then 


j a O17) 


ox 
= (42) + 5)A; 
j=0 
Proof. The identity (9.17) is connected with modular equations of degree 7. Thus, our 
first task is to calculate the singular modulus x7. To that end, we begin with a modular 
equation of degree 7 


178 __ 


{x(gox(q7)}"" + {. — xq) — xq’))} (9.18) 
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due to C. Guetzlaff in 1834 but rediscovered by Ramanujan in Entry 19(i) of Chapter 
19 of his second notebook [59], [13, p. 314]. In the notation of our definition of a 
modular equation after (3.5), we have set x(q) = k7(q), and so x(q’) = £7(q). Set 
g= e/V7 in (9.18) and use (9.9) and (9.8) to deduce that 


{x(t — x7)? = 1 and X7= xe 


Ramanujan calculated the singular modulus x; in his first notebook [59], [15, p. 290], 
from which, or from (9.19), we easily can deduce that 


fesse ae (9.20) 


(9.19) 


In the notation (9.16), from either [57], {58, p. 33], or [13, p. 468, Entry 5(iii)], 


filg) = 32(q)z(q") ¢ + V/x(g)x(q7) + V0 — xq) — x(q"))) : (9.21) 


Putting g = e7*/ v7 in (9.21) and employing (9.9) and (9.19), we find that 


filet?) =3V7 (1 todetl= x) 2=3V7. ad. (9.22) 
Letting n = 7 in (9.15) and (9.10), and using (9.20), we see that 


ou P(e!) +. 7 P(e-*~7) (9.23) 


and 


P(eW™Y7) = (1 — 2ey)z5 +3 DU — 2x1) jAjX4 
j=0 


moby 
eens ole: ar (9.24) 


respectively. Eliminating P(e~?*/ v7) from (9.22) and (9.23) and putting the resulting 
formula for P(e~2*~") in (9.24), we find that 


37 4, 27 21V7 2 V7 2 ys aoe 
Ix  16+7 in ae 


which upon simplification with the use of (9.3) yields (9.17). | 


10. JESUS GUILLERA. A discrete function A(n, k) is hypergeometric if 


AM+L A A(tn,k +1) 
A(n, k) A(n, k) 


are both rational functions. A pair of functions F (n, k) and G(n, k) is said to be a WZ 
pair (after H. S. Wilf and D. Zeilberger ) if F and G are hypergeometric and 


Fin +1,k) — F(n,k) = G(n,k + 1) — Gm, k). 
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In this case, H. Wilf and D. Zeilberger [55] showed that there exists a rational function 
C(n, k) such that 


G(n,k) = C(n, k) F(n,k). 
The function C(m, &) is called a certificate of (F, G). Defining 
Hin, k) = Fint+k,n+1)4+ Gin,n+hk), 


Wilf and Zeilberger showed that 


3 H(n,0) = 3 G(n, 0). 
n=0 n=0 


Ekhad (Zeilberger’s computer) and Zeilberger [46] were the first to use this method to 
derive a one-page proof of the representation 


2 4, (3), 
ee a (4j SDs. (10.1) 
j= 


The identity (10.1) was first proved by G. Bauer in 1859 [10]. Ramanujan recorded 
(10.1) as Example 14 in Section 7 of Chapter 10 in his second notebook [59], [12, 
pp. 23-24]. Further references can be found in [9]. In 1905, generalizing Bauer’s ap- 
proach, J. W. L. Glaisher [47] found further series for 1/7. 

Motivated by this work, Guillera [48] found many new WZ-pairs (F, G) and de- 
rived new series not only for 1/z but for 1/7? as well. One of his most elegant formu- 
las is 


128 <= °(820j? + 180; + 13) 
pee ae yepi(? ge 


Subsequently, Guillera empirically discovered many series of the type 
A GQ BP+DI+E 
ae aes ea 


Most of the series he discovered cannot be proved by the WZ-method, it appears that 
the WZ-method is only applicable to those series for 1/2 when H is a power of 2. An 
example of Guillera’s series which remains to be proved is [49] 


ss (3 1 2 1 5 
— SEG yy sf) G2) fo) Ma) iu aU 8) sa18 7 + 693) + 29). 


For further Ramanujan-like series for 1/27, see Zudilin’s papers [65], [66]. 
11. RECENT DEVELOPMENTS. We have emphasized in this paper that Clausen’s 


formula (6.1) is an essential ingredient in most proofs of Ramanujan-type series repre- 
sentations for 1/77. However, there are other kinds of series for 1/z that do not depend 
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upon Clausen’s formula. One such series discovered by Takeshi Sato [62] is given by 


tte LECT) Cae) 


je 


(11.1) 


In unpublished work (personal communication to the third author), Sato derived a 
more complicated series for 1/7 that yields approximately 97 digits of 7 per term. A 
companion to (11.1), which was derived by a new method devised by the third author, 
S. H. Chan, and Z.-G. Liu [31], is given by 


8 “Sut 
ya aan? (11.2) 
od 


_< Ww — 2v\ (2v\ fu? 
ECT os 


We cite three further new series arising from this new method. The first is another 
companion of (11.2), which arises from recent work of Chan and H. Verrill [36] (after 
the work of Almkvist and Zudilin [2]), and is given by 


cal 
9 a he 2—VQu—3v uM aty (3v)! 1 g 
ia Beg OT ene (a) 


u=0 v=0 


where 


The second is from a paper by Chan and K. P. Loo [35] and takes the form 
273(3 +272) = 2 3 -\" 
acid Sealine eo 1-=72)(-14+=V2) , 
d, B (« + v2) ( + 4 ) 
where 
“ v v 3 wv Lav 3 
eee) ae) 
v=o | j=0 i=0 


The third was derived by Y. Yang (personal communication) and takes the shape 
= 3 3 (ey 4u +1 
nJ/i5s 4 v} 36" © 


Motivated by his work with Mahler measures and new transformation formulas for 
5 F4 series, M. D. Rogers [61, Corollary 3.2] has also discovered series for 1/7 in the 
spirit of the formulas above. For example, if a,, is defined by (11.3), then 


2 7 3 +1 
== (-1)" 
a 2 a 32" 


This series was also independently discovered by Chan and Verrill [36]. 
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According to W. Zudilin [67], G. Gourevich empirically discovered a hyper- 
geometric-like series for 1/2, namely, 


7 
ao, et ts) 
a » e680! +767 + 144 + 1). 
pao 


This series and the search for further series representations for 1/2”, m > 2, are de- 
scribed in a paper by D. H. Bailey and J. M. Borwein [4]. 


12. CONCLUSION. One test of “good” mathematics is that it should generate more 
“good” mathematics. Readers have undoubtedly concluded that Ramanujan’s original 
series for 1/2 have sown the seeds for an abundant crop of “good” mathematics. 
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20. The computation of previously inaccessible 
digits of 7? and Catalan’s constant (2013) 


Paper 20: David H. Bailey, Jonathan M. Borwein, Andrew Mattingly and Glenn 
Wightwick, “The computation of previously inaccessible digits of 7? and Catalan’s 
constant,” Notices of the American Mathematical Society, vol. 60 (2013), p. 844— 
854. Reprinted by permission of the American Mathematical Society. 


Synopsis: 

An earlier selection (paper #14 in this collection) presented what is now known 
as the BBP formula for 7, which permits one to calculate binary or base-16 digits 
of 7 beginning at an arbitrary starting point. The original BBP paper presented 
a similar formula for 7?, permitting arbitrary binary digits of 1? to be calculated 
by this same general process. Since the publication of that paper, additional BBP- 
type formulas have also been found, among them one that permits arbitrary base-3 
digits of 7? to be calculated, and another that permits arbitrary binary digits of 
Catalan’s constant = >? _)(—1)"/(2n + 1)? = 0.9159965594... to be calculated. 

This paper outlines the history of computing a and other constants through 
the ages, and then gives details on three new computations: base-64 digits of 7, 
base-729 digits of 7? and base-4096 digits of Catalan’s constant, in each case be- 
ginning with position ten trillion. These computations, which required a total of 
approximately 1.5 x 10!9 floating-point arithmetic operations, and which ran for 
tens of “rack-days” on an IBM Blue-Gene computer, are comparable in total cost, 
say, to that of generating a state-of-the-art animated movie. 


Keywords: Computation, History, Normality, General Audience 
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The Computation of 
Previously Inaccessible 
Digits of mr and 
Catalan’s Constant 


David H. Bailey, Jonathan M. Borwein, Andrew Mattingly, 
and Glenn Wightwick 


Introduction 

We recently concluded a very large mathematical 
calculation, uncovering objects that until recently 
were widely considered to be forever inaccessible 
to computation. Our computations stem from the 
“BBP” formula for 7t, which was discovered in 
1997 using a computer program implementing the 
“PSLQ” integer relation algorithm. This formula 
has the remarkable property that it permits one 
to directly calculate binary digits of 7, beginning 
at an arbitrary position d, without needing to 
calculate any of the first d — 1 digits. Since 1997 
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numerous other BBP-type formulas have been 
discovered for various mathematical constants, 
including formulas for 7? (both in binary and 
ternary bases) and for Catalan’s constant. 

In this article we describe the computation 
of base-64 digits of 1r*, base-729 digits of 1, 
and base-4096 digits of Catalan’s constant, in 
each case beginning at the ten trillionth place, 
computations that involved a total of approximately 
1.549 x 10!° floating-point operations. We also 
discuss connections between BBP-type formulas 
and the age-old unsolved questions of whether and 
why constants such as mr, Tr’, log 2, and Catalan’s 
constant have “random” digits. 


Historical Background 

Since the dawn of civilization, mathematicians 
have been intrigued by the digits of mr [6], more 
so than any other mathematical constant. In the 
third century BCE, Archimedes employed a brilliant 
scheme of inscribed and circumscribed 3 - 2”-gons 
to compute tm to two decimal digit accuracy. 
However, this and other numerical calculations of 
antiquity were severely hobbled by their reliance 
on primitive arithmetic systems. 

One of the most significant scientific devel- 
opments of history was the discovery of full 
positional decimal arithmetic with zero by an 
unknown mathematician or mathematicians in 
India at least by 500 CE and probably earlier. Some 
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of the earliest documentation includes the Aryab- 
hatiya, the writings of the Indian mathematician 
Aryabhata dated to 499 CE; the Lokavibhaga, a 
cosmological work with astronomical observations 
that permit modern scholars to conclude that it was 
written on 25 August 458 CE [9]; and the Bakhshali 
manuscript, an ancient mathematical treatise that 
some scholars believe may be older still, but in 
any event is no later than the seventh century [7], 
[8], [2]. The Bakhshali manuscript includes, among 
other things, the following intriguing algorithm 
for computing the square root of q, starting with 
an approximation Xo: 


2 
= q—Xn 
an => 
2Xn 

2 
Gn 


2 (Xn + Gn)” 


(1) Xn+1 = Xn + Gn 


This scheme is quartically convergent in that it 
approximately quadruples the number of correct 
digits with each iteration (although it was never 
iterated more than once in the examples given in 
the manuscript) [2]. 

In the tenth century, Gerbert of Aurillac, who 
later reigned as Pope Sylvester II, attempted 
to introduce decimal arithmetic in Europe, but 
little headway was made until the publication of 
Fibonacci’s Liber Abaci in 1202. Several hundred 
more years would pass before the system finally 
gained universal, if belated, adoption in the West. 
The time of Sylvester’s reign was a very turbulent 
one, and he died in 1003, shortly after the death of 
his protector, Emperor Otto IIL. It is interesting to 
speculate how history would have changed had he 
lived longer. A page from his mathematical treatise 
De Geometria is shown in Figure 1. 


The Age of Newton 


Armed with decimal arithmetic and spurred by the 
newly discovered methods of calculus, mathemati- 
cians computed with aplomb. Again, the numerical 
value of 7 was a favorite target. Isaac Newton 
devised an arcsine-like scheme to compute digits of 
tr and recorded 15 digits, although he sheepishly 
acknowledged, “I am ashamed to tell you to how 
many figures I carried these computations, having 
no other business at the time.” Newton wrote 
these words during the plague year 1666, when, 
ensconced in a country estate, he devised the 
fundamentals of calculus and the laws of motion 
and gravitation. 

All large computations of 7r until 1980 relied 
on variations of Machin’s formula: 


TT 1 1 
(2) — = 4arctan (=) —arctan(55,). 


The culmination of these feats was a computation of 
Tr using (2) to 527 digits in 1853 by William Shanks, 
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Figure 1. Excerpt from De Geometria by Pope 
Sylvester II (reigned 999-1003 CE). 


later (erroneously) extended to 707 digits. In the 
preface to the publication of this computation, 
Shanks wrote that his work “would add little or 
nothing to his fame as a Mathematician, though 
it might as a Computer” (until 1950 the word 
“computer” was used for a person, and the word 
“calculator” was used for a machine). 

One motivation for such computations was to 
see whether the digits of 7 repeat, thus disclosing 
the fact that tr is a ratio of two integers. This 
was settled in 1761, when Lambert proved that 
tr is irrational, thus establishing that the digits 
of 7m do not repeat in any number base. In 1882 
Lindemann established that 7r is transcendental, 
thus establishing that the digits of 7r? or any 
integer polynomial of 7r cannot repeat, and also 
settling once and for all the ancient Greek question 
of whether the circle could be squared—it cannot, 
because all numbers that can be formed by 
finite straightedge-and-compass constructions are 
necessarily algebraic. 


The Computer Age 


At the dawn of the computer age, John von 
Neumann suggested computing digits of prominent 
mathematical constants, including 7 and e, for 
statistical analysis. At his instigation, ™ was 
computed to 2,037 digits in 1949 on the Electronic 
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of American History, Smithsonian Institution. 
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Figure 2. The ENIAC in the Smithsonian’s 
National Museum of American History. 


Numerical Integrator and Calculator (ENIAC); see 
Figure 2. In 1965 mathematicians realized that 
the newly discovered fast Fourier transform could 
be used to dramatically accelerate high-precision 
multiplication, thus facilitating not only large 
calculations of 7 and other mathematical constants 
but research in computational number theory as 
well. 

In 1976 Eugene Salamin and Richard Brent 
independently discovered new algorithms for 
computing the elementary exponential and trigono- 
metric functions (and thus constants such as 7 
and e) much more rapidly than by using classical 
series expansions. Their schemes, based on elliptic 
integrals and the Gauss arithmetic-geometric mean 
iteration, approximately double the number of 
correct digits in the result with each iteration. 
Armed with such techniques, 7T was computed to 
over one million digits in 1973, to over one billion 
digits in 1989, to over one trillion digits in 2002, 
and to over five trillion digits at the present time; 
see Table 1. 

Similarly, the constants e, ¢, V2, log 2, log 10, 
¢ (3), Catalan’s constant G = }?_9(-1)"/(2n+1)?, 
and Euler’s y constant have now been computed 
to impressive numbers of digits; see Table 2 [10]. 

One of the most intriguing aspects of this 
historical chronicle is the repeated assurances, 
often voiced by highly knowledgeable people, that 
future progress would be limited. As recently 
as 1963, Daniel Shanks, who himself calculated 
tT to over 100,000 digits, told Philip Davis that 
computing one billion digits would be “forever 
impossible.” Yet this feat was achieved less than 
thirty years later in 1989 by Yasumasa Kanada 
of Japan. Also in 1989, famous British physicist 
Roger Penrose, in the first edition of his best- 
selling book The Emperor’s New Mind, declared 
that humankind likely will never know if a string of 
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Table 1. Modern Computer-Era 7r Calculations. 


Name Year Correct Digits 
Miyoshi and Kanada 1981 2,000,036 
Kanada-Yoshino-Tamura 1982 16,777,206 
Gosper 1985 17,526,200 
Bailey Jan. 1986 29,360,111 
Kanada and Tamura Sep. 1986 33,554,414 
Kanada and Tamura Oct. 1986 67,108,839 
Kanada et. al Jan. 1987 134,217,700 
Kanada and Tamura Jan. 1988 201,326,551 
Chudnovskys May 1989 480,000,000 
Kanada and Tamura Jul. 1989 536,870,898 
Kanada and Tamura Nov. 1989 1,073,741,799 
Chudnovskys Aug. 1991 2,260,000,000 
Chudnovskys May 1994 4,044,000,000 
Kanada and Takahashi Oct. 1995 6,442,450,938 
Kanada and Takahashi Jul. 1997 51,539,600,000 
Kanada and Takahashi Sep. 1999 206,158,430,000 
Kanada-Ushiro-Kuroda Dec. 2002 | 1,241,100,000,000 
Takahashi Jan. 2009 | 1,649,000,000,000 
Takahashi Apr. 2009 | 2,576,980,377,524 
Bellard Dec. 2009 | 2,699,999,990,000 
Kondo and Yee Aug. 2010 | 5,000,000,000,000 


Table 2. Computations of Other Mathematical 


Constants. 

Constant Decimal digits | Researcher Date 
V2 1,000,000,000,000 | S. Kondo 2010 

ro) 1,000,000,000,000 | A. Yee 2010 

e 500,000,000,000 | S. Kondo 2010 
log 2 100,000,000,000 | S. Kondo 2011 
log 10 100,000,000,000 | S. Kondo 2011 
c(3) 100,000,001,000 | A. Yee 2011 
G 31,026,000,000 | A. Yee and R. Chan | 2009 

y 29,844,489,545 | A. Yee 2010 


ten consecutive 7s occurs in the decimal expansion 
of tr. This string was found just eight years later, 
in 1997, also by Kanada, beginning at position 
22,869,046,249. After being advised of this fact 
by one of the present authors, Penrose revised his 
second edition to specify twenty consecutive 7s. 

Along this line, Brouwer and Heyting, exponents 
of the “intuitionist” school of mathematical logic, 
proposed, as a premier example of a hypothesis 
that could never be formally settled, the question 
of whether the string “0123456789” appears in the 
decimal expansion of 7. Kanada found this at the 
17,387,594,880-th position after the decimal point. 
Even astronomer Carl Sagan, whose lead character 
in his 1985 novel Contact (played by Jodi Foster in 
the movie version) sought confirmation in base-11 
digits of m, expressed surprise to learn, shortly 
after the book’s publication, that m had already 
been computed to many millions of digits. 


The BBP Formula for pi 

A 1997 paper [3], [5, Ch. 3] by one of the present 
authors (Bailey), Peter Borwein and Simon Plouffe 
presented the following unknown formula for Tr, 
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now known as the “BBP” formula for Tr: 


(3) 


ae : "( 4 2 1 1 ) 
Kp 16K\ 8k +1 8kK+4 8kK+5  8k+6)" 


This formula has the remarkable property that 
it permits one to directly calculate binary or 
hexadecimal digits of m beginning at an arbitrary 
starting position without needing to calculate 
any of the preceding digits. The resulting simple 
algorithm requires only minimal memory, does not 
require multiple-precision arithmetic, and is very 
well suited to highly parallel computation. The 
cost of this scheme increases only slightly faster 
than the index of the starting position. 

The proof of this formula is surprisingly 
elementary. First note that for any k < 8, 


V/V2 yk-1 WE Se a 

— dx =| xeon ret dy 
j, 1—x8 0 p> 

(4) ob 


1 1 
© 2k/2 2 16'(8i +k)’ 


Thus one can write 


. 1 ( 4 2 1 1 ) 
16° \8i+1 8i+4 8i1+5 8i+6 


(5) i=0 2 _ _ 
[- 4/2 — 8x3 — 4V/2x4 — 8x° 
dx, 
0 1— x8 


which on substituting y := /2x becomes 
(6) : 
16y—- 16 
eh Sc | 
J, yi-2y3+4y-4 d 
I. 4y 1 4y-8 

Ie 22 Ne FS BHA 
reflecting a partial fraction decomposition of the 
integral on the left-hand side. In 1997 neither Maple 
nor Mathematica could evaluate (3) symbolically 
to produce the result 7r. Today both systems can 
do this easily. 


dy = 7, 


Binary Digits of log 2 

It is worth noting that the BBP formula (3) was not 
discovered by a conventional analytic derivation. 
Instead, it was discovered via a computer-based 
search using the PSLO integer relation detection 
algorithm (see the section “Hunt for a pi Formula”) 
of mathematician-sculptor Helaman Ferguson [4] 
in a process that some have described as an 
exercise in “reverse mathematical engineering”. 
The motivation for this search was the earlier 
observation by the authors of [3] that log 2 also has 
this arbitrary position digit calculating property. 
This can be seen by analyzing the classic formula 


(7) log2 = Y—, 
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which has been known at least since the time of 
Euler and which is closely related to the functional 
equation for the dilogarithm. 

Let r mod 1 denote the fractional part of a non- 
negative real number r, and let d be a nonnegative 
integer. Then the binary fraction of log 2 after the 
“decimal” point has been shifted to the right d 
places can be written as 
(8) 

(24 log 2) mod 1 


d 9d-k © 9d-k 
= > yo mod 1+ y ——  mod1]mod1 


k=1 k=d+1 
d 5d-k 

7 (= 2 mod k od 
k=1 


© 9d-k 
+ > ae mod : mod 1, 
k=d+1 
where “mod k” has been inserted in the numerator 
of the first term since we are only interested in the 
fractional part of the result after division. 

The operation 24-* mod k can be performed 
very rapidly by means of the binary algorithm 
for exponentiation. This scheme is the simple 
observation that an exponentiation operation such 
as 3!” can be performed in only five multiplications 
instead of 16 by writing it as 3!” = ((((32)2)?)2) +3. 
Additional savings can be realized by reducing all 
of the intermediate multiplication results modulo 
k at each step. This algorithm, together with the 
division and summation operations indicated in the 
first term, can be performed in ordinary double- 
precision floating-point arithmetic or for very 
large calculations by using quad- or oct-precision 
arithmetic. 

Expressing the final fractional value in binary 
notation yields a string of digits corresponding to 
the binary digits of log 2 beginning immediately 
after the first d digits of log 2. Computed results 
can be easily checked by performing this operation 
for two slightly different positions, say d — 1 and 
d, then checking to see that resulting digit strings 
properly overlap. 


Hunt for a pi Formula 


In the wake of finding the above scheme for the 
binary digits of log 2, the authors of [3] immediately 
wondered if there was a similar formula for 1 
(none was known at the time). Their approach was 
to collect a list of mathematical constants (c;) for 
which formulas similar in structure to the formula 
for log 2 were known in the literature and then to 
determine by means of the PSLQ integer relation 
algorithm if there exists a nontrivial linear relation 
of the form 


(9) Got + A, Q) + A2Q2+++++AnXn = 0, 
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where a; are integers (because such a relation 
could then be solved for 7r to yield the desired 
formula). After several months of false starts, the 
following relation was discovered: 

(10) 


1,3 
m= aah a 
4 


where the first term is a Gauss hypergeometric 
function evaluation. After writing this formula 
explicitly in terms of summations, the BBP formula 
for TT was uncovered: 

(11) 


1 1 
-+) + 2arctan (5) —log5, 


7 s + 4 2 1 1 ) 
imp LOEN8K +1 BkK+4  BK+5  BK+6)" 


One question that immediately arose in the 
wake of the discovery of the BBP formula for 7 
was whether there are formulas of this type for 1 
in other number bases—in other words, formulas 
where the 16 in the BBP formula is replaced by some 
other integer, such as 3 or 10. These computer 
searches were largely laid to rest in 2004, when one 
of the present authors (Jonathan Borwein), together 
with Will Galway and David Borwein, showed that 
there are no degree-1 BBP-type formulas of Machin- 
type for Tr, except those whose base is a power of 
two [5, pp. 131-133]. 


The BBP Formula in Action 


Variants of the BBP formula have been used 
in numerous computations of high-index digits 
of mr. In 1998 Colin Percival, then a 17-year- 
old undergraduate at Simon Fraser University 
in Canada, computed binary digits beginning at 
position one quadrillion (10!>). At the time, this 
was one of the largest, if not the largest, distributed 
computations ever done. More recently, in July 
2010, Tsz-Wo Sze of Yahoo! Cloud Computing, in 
roughly 500 CPU-years of computing on Apache 
Hadoop clusters, found that the base-16 digits of 
m beginning at position 5 x 10!4 (corresponding 
to binary position two quadrillion) are 

0 E6C1294A ED40403F 56D2D764 026265BC 

A98511D0 FCFFAA10 F4D28B1B B5392B8 
In an even more recent 2013 computation along 
this line, Ed Karrels of Santa Clara University used a 
system with NVIDIA graphics cards to compute 26 
base-16 digits beginning at position one quadrillion. 
His result: 8353CB3F7FOC9ACCFAQAA215F2. 

The BBP formulas have also been used to 
confirm other computations of 7r. For example, in 
August 2010, Shigeru Kondo (a hardware engineer) 
and Alexander Yee (an undergraduate software 
engineer) computed five trillion decimal digits of 
Tm ona home-built $18,000 machine. They found 
that the last thirty digits leading up to position 
five trillion are 
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(B) Alex Yee and his elephant. 


7497120374 4023826421 9484283852 


Kondo and Yee (see Figure 3) used the following 
Chudnovsky-Ramanujan series: 
(12) 
Oe ae i 
1 -12 s (—1)* (6k)!(13591409 + 545140134k) 
k=0 


T (3k)! (k!)3 6403203k+3/2 


They did not merely evaluate this formula as 
written but instead employed a clever quasi- 
symbolic scheme that mostly avoids the need for 
full-precision arithmetic. 

Kondo and Yee first computed their result in 
hexadecimal (base-16) digits. Then, in a crucial 
verification step, they checked hex digits near the 
end against the same string of digits computed 
using the BBP formula for 7r. When this test passed, 
they converted their entire result to decimal. The 
entire computation took ninety days, including 
sixty-four hours for the BBP confirmation and eight 
days for base conversion to decimal. Note that the 
much lower time for the BBP confirmation, relative 
to the other two parts, greatly reduced the overall 
computational cost. A description of their work is 
available at [11]. 
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BBP-Type Formulas for Other Constants 
In the years since 1997, computer searches 
using the PSLO algorithm, as well as conven- 
tional analytic investigations, have uncovered 
BBP-type formulas for numerous other math- 
ematical constants, including 1’, log’ 2, log 2, 
C(3), 13, log? 2, 1? log 2,774,€(5) and Catalan’s 
constant. BBP formulas are also known for many 
arctangents, as well as for logk, 2 < k < 22, al- 
though none is known for log 23. These formulas 
and many others, together with references, are 
given in an online compendium [1]. 

One particularly intriguing fact is that, whereas 
only binary formulas exist for 7r, there are both 
binary and ternary (base-3) formulas for 7r?: 


a as -( 16 24 8 
8 & 64k\ (6k +1)2 (6K +2)2 (6K + 3)? 


6 i. 1 ) 
(6k+4)2 (6k +5)2/’ 


(14) 
a 2 3 1 ( 243 405 
27 & 729k \(12k +1)? (12k + 2) 
81 27 72 
(12k+4)2 (12k+5)2. (12k + 6)2 
9 9 5 
(12k+7)2 (12k+8)2 (12k + 10)2 


1 
* 2k+ TT 
Formula (13) appeared in [3], while formula (14) 
is due to Broadhurst. There are known binary BBP 
formulas for both ¢(3) and 7r?, but no one has 
found a ternary formula for either. 


Catalan’s Constant 


One other mathematical constant of central interest 
is Eugéne Charles Catalan’s (1814-1894) constant, 
(15) 

—  (-1)" = 

G= ¥ ——,, =0.91596559417722..., 
imp (2n + 1)4 

which is arguably the most basic constant whose 
irrationality and transcendence (though strongly 
suspected) remain unproven. Note the close 
connection to this formula for 1r?: 
(16) 


7 1 
- 2. (n+ 12 = 1-2337005501362.... 
Formulas (15) and (16) can be viewed as the simplest 
Dirichlet L-series values at 2. Such considerations 
were behind our decision to focus the computation 
described in this paper on these two constants. 
Catalan’s constant has already been the sub- 
ject of some large computations. As mentioned 
above, in 2009 Alexander Yee and Raymond Chan 
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calculated G to 31.026 billion digits [10]. This 
computation employed two formulas, including 
this formula due to Ramanujan: 

3 = 1 
8 ro (27) (n+ 1)2 
which can be derived from the fact that 


G = -T(m/4) = -3/2+ T(11/12), 


(17) G + z log(2 + V3), 


where T(@) := ty log tano do. 

The BBP compendium lists two BBP-type formu- 
las for G. The first was discovered numerically by 
Bailey, but both it and the second formula were 
subsequently proven by Kunle Adegoke, based in 
part on some results of Broadhurst. 

For the present study, we sought a formula for G 
with as few terms as possible, because the run time 
for computing with a BBP-type formula increases 
roughly linearly with the number of nonzero 
coefficients. The two formulas in the compendium 
have twenty-two and eighteen nonzero coefficients, 
respectively. So we explored, by means of a 
computation involving the PSLO algorithm, the 
linear space of formulas for G spanned by these 
two sets of coefficients, together with two known 
“zero relations” (BBP-type formulas whose sum 
is zero). These analyses and computations led 
to the following formula, which has only sixteen 
nonzero coefficients and which we believe to be the 
most economical BBP-type formula for computing 
Catalan’s constant: 


(18) 
yes i} s 1 ( 36864 30720 
4096 & 4096k \(24k + 2)2 (24k +392 
30720 6144 1536 
(24k+4)2 (24k +6)2 (24k 47) 
2304 2304 768 
* (24k 49)2 > (24k4+10)2 > (24k + 14)2 
480, 384 1536 
(24k+15)2 (24k+11)2 (24k 412)2 
24 120 36 
* (@4k+19)2 (24k +20)2 (24k 4 21)2 
___ 48 6 ) 
(24k + 22)2 (24k +23)2)° 


BBP Formulas and Normality 

One prime motivation in computing and analyzing 
digits of m and other well-known mathematical 
constants through the ages is to explore the age-old 
question of whether and why these digits appear 
“random”. Numerous computer-based statistical 
checks of the digits of m—unlike those of e—so 
far have failed to disclose any deviation from 
reasonable statistical norms. See, for instance, 
Table 3, which presents the counts of individual 
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hexadecimal digits among the first trillion hex 
digits, as obtained by Yasumasa Kanada. 

Given some positive integer b, areal number « is 
said to be b-normal if every m-long string of base-b 
digits appears in the base-b expansion of « with 
precisely the expected limiting frequency 1/b"™. It 
follows from basic probability theory that almost 
all real numbers are b-normal for any specific 
base b and even for all bases simultaneously. But 
proving normality for specific constants of interest 
in mathematics has proven remarkably difficult. 

Interest in BBP-type formulas was heightened by 
the 2001 observation, by one of the present authors 
(Bailey) and Richard Crandall, that the normality 
of BBP-type constants such as 1, 7, log 2 and G 
can be reduced to a certain hypothesis regarding 
the behavior of a class of chaotic iterations [5, 
pp. 141-173]. No proof is known for this general 
hypothesis, but even specific instances of this 
result would be quite interesting. For example, if 
it could be established that the iteration given by 
Wo = 0 and 


(19) Wh = (2-1 + “) mod 1 


is equidistributed in [0, 1) (ie., is a “good” pseu- 
dorandom number generator), then, according to 
the Bailey-Crandall result, it would follow that 
log 2 is 2-normal. In a similar vein, if it could be 
established that the iteration given by x9 = 0 and 


(20) Xn = (16% 


r 120n? — 89n + 16 
512n4 — 1024n3 + 712n2 — 206n+ 21 


is equidistributed in [0,1), then it would follow 
that 7 is 2-normal. 

Giving further hope to these studies is the recent 
extension of these methods to a rigorous proof 
of normality for an uncountably infinite class of 
real numbers. Given a real number r in [0, 1), let 
r, denote the k-th binary digit of r. Then the real 
number 


) moa 1 


(21) 023(r) = > 

k=0 
is 2-normal. For example, the constant «2,3(0) = 
X<keo 1/(3k23") = 0.541883680831502985... is 
provably 2-normal. A similar result applies if 2 
and 3 in this formula are replaced by any pair 
of coprime integers (b,c) greater than one [5, 
pp. 141-173]. 


1 
3k 23k +r, 


A Curious Hexadecimal Conjecture 


It is tantalizing that if, using (20), one calculates 
the hexadecimal digit sequence 


(22) Yn = |16xn] 
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Table 3. Digit counts in the first trillion 
hexadecimal (base-16) digits of 7. Note that 
deviations from the average value 
62,500,000,000 occur only after the first six 
digits, as expected. 


Occurrences 
62499881108 
62500212206 
62499924780 
62500188844 
62499807368 
62500007205 
62499925426 
62499878794 
62500216752 
62500120671 
62500266095 
62499955595 
62500188610 
62499613666 
62499875079 
62499937801 

1000000000000 


Hex Digit 


Total 


(where |-| denotes greatest integer), then the 
sequence (yn) appears to perfectly (not just 
approximately) produce the hexadecimal expansion 
of tr. In explicit computations, we checked that 
the first 10,000,000 hexadecimal digits generated 
by this sequence are identical with the first 
10,000,000 hexadecimal digits of tr — 3. This is a 
fairly difficult computation, as it requires roughly 
n? bit-operations and is not easily performed on a 
parallel computer system. In our implementation, 
computing 2,000,000 hex digits with (22) using 
Maple, required 17.3 hours on a laptop. Computing 
4,100,000 using Mathematica, with a more refined 
implementation, required 46.5 hours. The full 
confirmation using a C++ program took 433,192 
seconds (120.3 hours) on an IBM Power 780 
(model: 9179-MHB, clock speed: 3.864 GHz). All 
these outputs were confirmed against stored hex 
digits of m in the software section of http: 
//www.experimentalmath. info. 


Conjecture 1. The sequence |16Xn|, where (Xn) is 
the sequence of iterates defined in equation (20), 
generates precisely the hexadecimal expansion of 
T= 3: 


We can learn more. Let ||x — y|| = min(|x — 


y|,|1 — (x-y)]) denote the “wrapped” distance 
between reals x and y in [0,1). The base-16 
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expansion of 7, which we denote Ttp, satisfies 


(23) 
||Tn — Xnll 
. ; 120k? —89k+16 
* pe LOK” (512k4—1024k3 +712k2— 206k +21) 
7 1 
~ 64(n+ 1)2’ 


so that ,upon summing from some N to infinity, 
we obtain the finite value 


< 1 
(24) > [| — Xnl| < 
n=N 


64(N +1)" 


Heuristically, let us assume that the 7, are inde- 
pendent, uniformly distributed random variables 
in (0,1), and let 6, = ||&, — Xp||. Note that an 
error (i.e., an instance where xy lies in a subinterval 
of the unit interval different from 7r, so that the 
corresponding hex digits don’t match) can only 
occur when Tt is within 6, of one of the points 
(0, 1/16, 2/16,...,15/16). Since XxX, < Tt for all n 
(where < is interpreted in the wrapped sense when 
Xn is slightly less than one), this event has proba- 
bility 166,. Then the fact that the sum (24) has a 
finite value implies that, by the first Borel-Cantelli 
lemma, there can only be finitely many errors. 
Further, the small value of the sum (24), even when 
N = 1, suggests that it is unlikely that any errors 
will be observed. If we set N = 10,000, 001 in (24), 
since we know there are no errors in the first 
10,000,000 elements, we obtain an upper bound 
of 1.563 x 10~°, which suggests it is truly unlikely 
that errors will ever occur. 

A similar correspondence can be seen between 
iterates of (19) and the binary digits of log 2. In 
particular, let Z, = |2Wn]|, where Wy is given in (19). 
Then since the sum of the error terms for log 2, 
corresponding to (24), is infinite, it follows by the 
second Borel-Cantelli lemma that discrepancies 
between (z,,) and the binary digits of log 2 can be 
expected to appear indefinitely but with decreasing 
frequency. Indeed, in computations that we have 
done, we have found that the sequence (Zn) 
disagrees with ten of the first twenty binary digits 
of log 2, but in only one position over the range 
5,000 to 8,000. 


Computing Digits of 7r* and Catalan’s 
Constant 

In illustration of this theory, we now present the 
results of computations of high-index binary digits 
of 7°, ternary digits of 7°, and binary digits of 
Catalan’s constant, based on formulas (13), (14), 
and (18), respectively. These calculations were 
performed on a 4-rack BlueGene/P system at IBM’s 
Benchmarking Center in Rochester, Minnesota (see 
Figure 4). This is a shared facility, so calculations 
were conducted over a several-month period during 
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Figure 4. Andrew Mattingly, Blue Gene/P, and 
Glenn Wightwick. 


which time, none, some, or all of the system 
was available. It was programmed remotely from 
Australia, which permitted the system to be used 
off-hours. Sometimes it helps to be in a different 
time zone! 

(1) Base-64 digits of tt? beginning at posi- 
tion 10 trillion. The first run, which 
produced base-64 digits starting from 
position 10!* — 1, required an average 
of 253,529 seconds per thread and was 
subdivided into seven partitions of 2048 
threads each, so the total cost was 
7» 2048 - 253529 = 3.6 x 109 CPU-seconds. 
Each rack of the IBM system features 4096 
cores, so the total cost is 10.3 “rack-days”. 
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The second run, which produced base-64 
digits starting from position 10!*, com- 
pleted in nearly the same run time (within 
a few minutes). The two resulting base-8 
digit strings appear in row A of Table X. 
(Each pair of base-8 digits corresponds 
to a base-64 digit.) Here the digits in 
agreement are delimited by |. Note that 53 
consecutive base-8 digits (or, equivalently, 
159 consecutive binary digits) are in 
perfect agreement. 

Base-729 digits of tt? beginning at position 
10 trillion. In this case the two runs each 
required an average of 795,773 seconds 
per thread, similarly subdivided as above, 
so that the total cost was 6.5 x 109 CPU- 
seconds, or 18.4 “rack-days”. The two 
resulting base-9 digit strings are found in 
row B of Table X. (Each triplet of base-9 
digits corresponds to one base-729 digit.) 
Note here that 47 consecutive base-9 digits 
(94 consecutive base-3 digits) are in perfect 
agreement. 

Base-4096 digits of Catalan’s constant be- 
ginning at position 10 trillion. These two 
runs each required 707,857 seconds per 
thread, but in this case they were subdi- 
vided into eight partitions of 2048 threads 
each, so that the total cost was 1.2 x 10!° 
CPU-seconds, or 32.8 “rack-days”. The two 
resulting base-8 digit strings are found in 
row C of Table X. (Each quadruplet of base-8 
digits corresponds to one base-4096 digit.) 
Note that 47 consecutive base-8 digits (141 
consecutive binary digits) are in perfect 
agreement. 


These long strings of consecutively agreeing 
digits, beginning with the target digit, provide a 
compelling level of statistical confidence in the 
results. In the first case, for instance, note that 
the probability that thirty-two pairs of randomly 
chosen base-8 digits are in perfect agreement is 
roughly 1.2 x 10-29. Even if one discards, say, the 
final six base-8 digits as a 1-in-262,144 statistical 
safeguard against numerical round-off error, one 
would still have twenty-four consecutive base-8 
digits in perfect agreement, with a corresponding 
probability of 2.1 x 10~. Now strictly speaking, 
one cannot define a valid probability measure on 
digits of m2, but nonetheless, from a practical 
point of view, such analysis provides a very high 
level of statistical confidence that the results have 
been correctly computed. 

For this reason, computations of 7 and the 
like are a favorite tool for the integrity testing for 
computer system hardware and software. If either 
run of a paired computation of 7m succumbs to 
even a single fault in the course of the computation, 


(2 


~— 


=~ 
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600 


700 it) 


Figure 5. A “random” walk ona million digits of 
Catalan’s constant. 


then typically the final results will disagree almost 
completely. For example, in 1986 a similar pair 
of computations of 7 disclosed some subtle but 
substantial hardware errors in an early model of the 
Cray-2 supercomputer. Indeed, the calculations we 
have done arguably constitute the most strenuous 
integrity test ever performed on the BlueGene/P 
system. Table 4 gives some sense of the scale of the 
three record computations, which used more than 
135 “rack-days”, 1378 serial CPU-years, and more 
than 1.549 x 10!9 floating point operations. This is 
comparable to the cost of the most sophisticated 
animated movies as of the present time (2011). 

For the sake of completeness, in Table 5 we 
also record the one-, two-, and three-bit frequency 
counts from our Catalan computation. 


Future Directions 
It is ironic that, in an age when even pillars 
such as Fermat’s Last Theorem and the Poincaré 
conjecture have succumbed to the brilliance of 
modern mathematics, one of the most elementary 
mathematical hypotheses, namely whether (and 
why) the digits of 7m or other constants such 
as log 2,7’, or G (see Figure 5) are “random”, 
remains unanswered. In particular, proving that 
Tt (or log 2, 1r*, or G) is b-normal in some integer 
base b remains frustratingly elusive. Even much 
weaker results, for instance the simple assertion 
that a one appears in the binary expansion of tr (or 
log 2, 1r*, or G) with limiting frequency 1/2 (which 
assertion has been amply affirmed in numerous 
computations over the years), remain unproven 
and largely inaccessible at the present time. 
Almost as much ignorance extends to simple 
algebraic irrationals such as ,/2. In this case it 
is now known that the number of ones in the 
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Table 4. (A) base-4 digits of 7, (B) base-729 digits of 1r°, and (C) base-4096 digits of Catalan’s 
constant, in each case beginning at position 10 trillion. 


A 75 | 60114505303236475724500005743262754530363052416350634 | 573227604 
|60114505303236475724500005743262754530363052416350634 | 220210566 

B 001] 12264485064548583177111135210162856048323453468 | 10565567635862 
| 12264485064548583177111135210162856048323453468 | 04744867134524 

Cc 0176 | 34705053774777051122613371620125257327217324522 | 6000177545727 
| 34705053774777051122613371620125257327217324522 | 5703510516602 


Table 5. The scale of our computations. We estimate 4.5 quad-double operations per iteration and 
that each costs 266 single-precision operations. The total cost in single-precision operations is given 
in the last column. This total includes overhead which is largely due to a rounding operation that we 


implemented using bit-masking. 


#ITERS TIME/ITER TIME WITH TOTAL O'’HEAD FLOPS 
CONSTANT nn’ 3 d- (x10!) (microsec) (yr) VERIFY (yr) (%) (x1018) 
Tr? base-2® 5 1038 2.16 1.424 97.43 194.87 230.35 18.2 2.58 
Tr base-3® 9 108 3.89 1.424 175.38 350.76 413.16 17.8 4.65 
Gbase-45 16 10/3 6.91 1.424 311.79 623.58 735.02 17.9 8.26 
Table 6. Base-4096 digits of G beginning at position 10 trillion: digit proportions. 
Digit 0 1 2 3 4 5 6 ? 
base-2 (141) | 0.454 0.546 “ - - - 7 7 
base-4 (70) | 0.171 0.329 0.229 0.271 = - - - 
base-8 (47) | 0.085 0.128 0.213 0.128 0.064 0.128 0.043 0.213 


first n binary digits of ./2 must be at least of 
the order of \/n, with similar results for other 
algebraic irrationals [5, pp. 141-173]. But this is 
a very weak result, given that this limiting ratio 
is almost certainly 1/2, not only for /2 but more 
generally for all algebraic irrationals. 

Nor can we prove much about continued frac- 
tions for various constants, except for a few 
well-known results for special cases such as 
quadratic irrationals, ratios of Bessel functions, 
and certain expressions involving exponential 
functions. 

For these reasons there is continuing interest 
in the theory of BBP-type constants, since, as men- 
tioned, there is an intriguing connection between 
BBP-type formulas and certain chaotic iterations 
that are akin to pseudorandom number generators. 
If these connections can be strengthened, then 
perhaps normality proofs could be obtained for a 
wide range of polylogarithmic constants, possibly 
including 7, log 2, 7, and G. 

As settings change, so do questions. Until the 
question of efficient single-digit extraction was 
asked, our ignorance about such issues was not 
exposed. The case of the exponential series 


© An 
xX 
a= > —_ 
n! 
n=0 


(25) 
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is illustrative. For present purposes, the conver- 
gence rate in (25) is too good. 


Conjecture 2. There is no BBP formula for e. More- 
over, there is no way to extract individual digits of 
e significantly more rapidly than by computing the 
first n digits. 


The same could be conjectured about 
other numbers, including Euler’s constant 
y = 0.57721566490153.... In short, until vastly 
stronger mathematical results are obtained in this 
area, there will doubtless be continuing interest in 
computing digits of these constants. In the present 
vacuum, that is perhaps all that we can do. 
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Synopsis: 

As mentioned earlier, an age-old question (one that has been the motivation 
for both ancient and computer-age computations of 7 and other constants) is that 
of whether (and why) these digits are “random,” which is usually taken to be the 
property of normality: a number a is said to be b-normal if every m-long string in 
the base-b expansion of a appears, in the limit, with frequency precisely 1/b™. It 
is straightforward to show, using probability and/or measure theory, that almost 
all real numbers must be normal, but it has been very difficult to prove normality 
for any of the classical constants of mathematics. 

In this paper, the authors apply some new computer-based tools to address 
these questions. One of these is to cast, say, the base-4 digits of a constant, as a 
random walk, where at each step one moves a unit right, up, left or down depending 
on whether the digit at the given position is 0, 1, 2 or 3 (with similar extensions to 
other number bases). These analyses produce some surprising results, showing, for 
instance, that although the Stoneham numbers are provably normal, they exhibit 
self-similarity properties atypical of a truly “random” sequence. Numerous results 
along this line are presented. 


Keywords: Curiosities, Graphical Representation, Normality, Random Walks 
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Walking on Real 


Numbers 


Francisco J. ARAGON Artacuo, Davip H. Bailey, JoNATHAN M. Borwein, 


AND Peter B. Borwein 


TI Mhe digit expansions of 2, e, V2, and other mathemat- 
ical constants have fascinated mathematicians from 
the dawn of history. Indeed, one prime motivation 

for computing and analyzing digits of is to explore the 

age-old question of whether and why these digits appear 

“random.” The first computation on ENIAC in 1949 of z to 

2037 decimal places was proposed by John von Neumann 

so as to shed some light on the distribution of x (and of e) 

(15, pg. 277-281]. 

One key question of some significance is whether (and 
why) numbers such as zand eare “normal.” A real constant is 
b-normal if, given the positive integer b > 2, every m-long 
string of base-b digits appears in the base-b expansion of « 
with precisely the expected limiting frequency 1/b”. It is a 
well-established, albeit counterintuitive, fact that given an 
integer b > 2, almost all real numbers, in the measure theory 
sense, are b-normal. What’s more, almost all real numbers are 
b-normal simultaneously for all positive integer bases (a 
property known as “absolutely normal’). 

Nonetheless, it has been surprisingly difficult to prove 
normality for well-known mathematical constants for any 
given base b, much less all bases simultaneously. The first 
constant to be proven 10-normal is the Champernowne 
number, namely the constant 0.12345678910111213141516. .., 
produced by concatenating the decimal representation of all 
positive integers in order. Some additional results of this sort 
were established in the 1940s by Copeland and Erdés [26]. 

At present, normality proofs are not available for any well- 
known constant suchas 7, e, log 2, 2. We do not even know, 
say, that a 1 appears one-half of the time, in the limit, in the 
binary expansion of V2 (although it certainly appears to), nor 
do we know for certain that a 1 appears infinitely often in the 
decimal expansion of 2. For that matter, it is widely believed 


that every irrational algebraic number (..e., every irrational root 
of an algebraic polynomial with integer coefficients) is 
b-normal to all positive integer bases b, but there is no proof, 
not for any specific algebraic number to any specific base. 

In 2002, one of the present authors (Bailey) and Richard 
Crandall showed that given a real number r in [0,1), with 7 
denoting the &-th binary digit of 7, the real number 


= 1 
2,.3(7) 2= Eee: (1) 


k=1 ~ 


is 2-normal. Itcan be seen thatifr # s, then a23(7) 4 o%23(s), 
so that these constants are all distinct. Since r can range over 
the unit interval, this class of constants is uncountable. So, for 
example, the constant 023 = %23(0) = Y,s, 1/(3*2) = 
0.0418836808315030... is provably 2-normal (this special 
case was proven by Stoneham in 1973 [43]). A similar result 
applies if 2 and 3 in formula (1) are replaced by any pair of 
coprime integers (b, c) with b > 2 and c > 2 [10]. More 
recently, Bailey and Michal Misiurewicz were able to 
establish 2-normality of #23 by a simpler argument, by uti- 
lizing a “hot spot” lemma proven using ergodic theory 
methods [11]. 

In 2004, two of the present authors (Bailey and Jonathan 
Borwein), together with Richard Crandall and Carl Pomer- 
ance, proved the following: If a positive real y has algebraic 
degree D > 1, then the number #(y, N) of 1-bits in the binary 
expansion of ythrough bit position N satisfies #(y, N) > CN”, 
for a positive number C (depending on y) and all sufficiently 
large N [5]. A related result has been obtained by Hajime 
Kaneko of Kyoto University in Japan [37]. However, these 
results fall far short of establishing b-normality for any irra- 
tional algebraic in any base b, even in the single-digit sense. 
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1 Twenty-First Century Approaches 

to the Normality Problem 
In spite of such developments, there is a sense in the field that 
more powerful techniques must be brought to bear on this 
problem before additional substantial progress can be 
achieved. One idea along this line is to study directly the 
decimal expansions (or expansions in other number bases) of 
various mathematical constants by applying some techniques 
of scientific visualization and large-scale data analysis. 

In a recent paper [4], by accessing the results of several 
extremely large recent computations [46, 47], the authors tes- 
ted the first roughly four trillion hexadecimal digits of x by 
a Poisson process model: in this model, it is 
extraordinarily unlikely that z is not normal base 16, given its 
initial segment. During that work, the authors of [4], like many 
others, investigated visual methods of representing their large 
mathematical data sets. Their chosen tool was to represent 
these data as walks in the plane. 

In this work, based in part on sources such as [22, 23, 21, 19, 
14], we make a more rigorous and quantitative study of these 
walks on numbers. We pay particular attention to 2, for which 
we have copious data and which—despite the fact that its 
digits can be generated by simple algorithms—behaves 
remarkably “randomly.” 

The organization of the article is as follows. In Section 2 we 
describe and exhibit uniform walks on various numbers, both 
rational and irrational, artificial and natural. In the next two 
sections, we look at quantifying two of the best-known fea- 
tures of random walks: the expected distance traveled after 
Nsteps (Section 3) and the number of sites visited (Section 4) 
In Section 5 we describe two classes for which normality and 
nonnormality results are known, and one for which we have 
only surmise. In Section 6 we show various examples and 


means O. 


leave some open questions. Finally, in Appendix 7 we collect 
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the numbers we have examined, with concise definitions and 
a few digits in various bases. 


2 Walking on Numbers 


2.1 Random and Deterministic Walks 

One of our tasks is to compare deterministic walks (such as 
those generated by the digit expansion of a constant) with 
pseudorandom walks of the same length. For example, in 
Figure 1 we draw a uniform pseudorandom walk with one 
million base-4 steps, where at each step the path moves one 
unit east, north, west, or south, depending on the whether the 
pseudorandom iterate at that position is 0, 1, 2, or 3. The color 
indicates the path followed by the walk—it is shifted up the 
spectrum (red-orange-yellow-green-cyan-blue-purple-red) 


following an HSV scheme with Sand V equal to one. The HSV 
chue, saturation, and value) model is a cylindrical-coordinate 
representation that yields a rainbow-like range of colors. 
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Let us now compare this graph with that of some rational 
numbers. For instance, consider these two rational numbers 


Ql and Q2: 
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345 


But even more information is exhibited when we view a 
plot of the base-4 digits of Q1 and Q2 as deterministic 


walks, as shown in Figure 2. Here, as above, at each step 


Qi= 
1049012271677499437486619280565448601617567358491560876166848380843 
144358447 25287555162924702775955557 04537 156793130587832477297720217 
7081818796590637365767487981422801328592027861019258140957 135748704 
71229026746515131280595419539975042020613803738223389597 13391954 

/ 
1612226962694290912940490066273549214229880755725468512353395718465 
1913530173488143140175045399694454793530120643833272670970079330526 
2920303509209736004509554561365966493250783914647728401623856513742 
9529453089612268152748875615658076162410788075184599421938774835 


Q2= 
7278984857066874130428336124347 736557 760097920257 997 246066053320967 
1510416153622193809833373062647935595578496622633151106310912260966 
756877897 7976821682512653537303069288477901523227013159658247897670 
30435402490295493942131091063934014849602813952 

/ 
1118707184315428172047608747409173378543817936412916114431306628996 
525937709097818724425 1666337745459 15209355828867 1765654061273733231 
7877736113382974861639142628415265543797274479692427652260844707187 
532155254872952853725026318685997495262134665215 


At first glance, these numbers look completely dissimilar. 
However, if we examine their digit expansions, we find that 
they are very close as real numbers: the first 240 decimal digits 
are the same, as are the first 400 base-4 digits. 
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the path moves one unit east, north, west, or south, 
depending on the whether the digit in the corresponding 
position is 0, 1, 2, or 3, and with color coded to indicate the 
overall position in the walk. 
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(a) A 440-step walk on Q1 base 4. 


Figure 2. Walks on the rational numbers Q1 and Q2. 


Q __ 3624360069 
(a) Q3 ~~ 7000000001 


(b) An 8240-step walk on Q2 base 4. 


(b) Q4= 123456789012 


1000000000061 


Figure 3. Walks on the first million base-10 digits of the rationals Q3 and Q4 from [39]. 


The rational numbers Q1 and Q2 represent the two possi- 
bilities when one computes a walk on a rational number: 
either the walk is bounded as in Figure 2(a) (for any walk with 
more than 440 steps one obtains the same plot), or it is 
unbounded but repeating some pattern after a finite number 
of digits as in Figure 2(b). 

Of course, not all rational numbers are that easily identified 
by plotting their walks. It is possible to create a rational 
number whose period is of any desired length. For example, 
the following rational numbers from [39], 


_ 3624360069 o4= 123456789012 


05 = 7000000001 ~ 1000000000061’ 


have base-10 periodic parts with length 1,750,000,000 and 
1,000,000,000,060, respectively. A walk on the first million 
digits of both numbers is plotted in Figure 3. These huge 
periods derive from the fact that the numerators and 
denominators of Q3 and Q4 are relatively prime, and the 
denominators are not congruent to 2 or 5. In such cases, the 
period P is simply the discrete logarithm of the denomi- 
nator D modulo 10; or, in other words, P is the smallest 
n such that 10” mod D = 1. 

Graphical walks can be generated in a similar way for other 
constants in various bases—see Figures 2 through 7. Where 
the base b > 3, the base-b digits can be used to a select, as a 
direction, the corresponding base-b complex root of unity—a 
multiple of 120° for base three, a multiple of 90° for base four, a 
multiple of 72° for base 5, etc. We generally treat the case 
b = 2asa base-4 walk, by grouping together pairs of base-2 
digits (we could render a base-2 walk on a line, but the 
resulting images would be much less interesting). In Figure 4 
the origin has been marked, but since this information is not 
that important for our purposes, and can be approximately 
deduced by the color in most cases, it is not indicated in the 
others. The color scheme for Figures 2 through 7 is the same as 


Figure 4. A million-step base-4 walk on e. 


the above, except that Figure 6 is colored to indicate the 
number of returns to each point. 


2.2 Normal Numbers as Walks 

As noted previously, proving normality for specific constants 
of interest in mathematics has proven remarkably difficult. 
The tenor of current knowledge in this arena is illustrated by 
[45, 14, 34, 38, 40, 39, 44]. It is useful to know that, while small 
in measure, the “absolutely abnormal” or “absolutely non- 
normal” real numbers (namely those that are not b-normal for 
any integer b) are residual in the sense of topological category 
[1]. Moreover, the Hausdorff—Besicovitch dimension of the set 
of real numbers having no asymptotic frequencies is equal to 
1. Likewise the set of Liouville numbers has measure zero but 
is of the second category [18, p. 352]. 
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Figure 5. A walk on the first 100 billion base-4 digits of 1 
(normal?). 


Figure 6. A walk on the first 100 million base-4 digits of 7, col- 
ored by number of returns (normal?). Color follows an HSV model 
(green-cyan-blue-purple-red) depending on the number of returns 
to each point (where the maxima show a tinge of pink/red). 


One question that has possessed mathematicians for cen- 
turies is whether z is normal. Indeed, part of the original 
motivation of the present study was to develop new tools for 
investigating this age-old problem. 

In Figure 5 we show a walk on the first 100 billion base-4 
digits of x. This may be viewed dynamically in more detail 
online at http://gigapan.org/gigapans/106803, where the full- 
sized image has a resolution of 372,224 x 290,218 pixels 
(108.03 gigapixels in total). This must be one of the largest 
mathematical images ever produced. The computations for 
creating this image took roughly a month, where several parts 
of the algorithm were run in parallel with 20 threads on 
CARMA’s MacPro cluster. 

By contrast, Figure 6 exhibits a 100 million base-4 walk on 
m, where the color is coded by the number of returns to the 
point. In [4], the authors empirically tested the normality of its 
first roughly four trillion hexadecimal (base-16) digits using a 
Poisson process model, and they concluded that, according to 
this test, it is “extraordinarily unlikely” that z is not 16-normal 
(of course, this result does not pretend to be a proof). 

In what follows, we propose various methods of analyzing 
real numbers and visualizing them as walks. Other methods 
widely used to visualize numbers include the matrix repre- 
sentations shown in Figure 8, where each pixel is colored 
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depending on the value of the digit to the right of the decimal 
point, following a left-to-right up-to-down direction (in base 4 
the colors used for 0, 1, 2, and 3 are red, green, cyan, and 
purple, respectively). This method has been mainly used to 
visually test “randomness.” In some cases, it clearly shows the 
features of some numbers; as for small periodic rationals, see 
Figure 8(c). This scheme also shows the nonnormality of the 
number 2 3—see Figure 8(d) (where the horizontal red bands 
correspond to the strings of zeroes)—and it captures some of 
the special peculiarities of the Champernowne’s number C, 
(normal) in Figure 8(e). Nevertheless, it does not reveal 
the apparently nonrandom behavior of numbers such 
as the Erdés—Borwein constant; compare Figure 8(f) with 
Figure 7(e). See also Figure 21. 

As we will see in what follows, the study of normal numbers 
and suspected normal numbers as walks will permit us to 
compare them with true random (or pseudorandom) walks, 
obtaining in this manner a new way to empirically test “ran- 
domness” in their digits. 


3 Expected Distance to the Origin 
Let b € {3,4,...} bea fixed base, and let X1, X2,X3,. 
sequence of independent bivariate discrete random variables 
whose common probability distribution is given by 


p(x= (Srey )) =3 fork=1,...8 (2) 


Then the random variable S*:= ye 1 X represents a 
base-b random walk in the plane of N steps. 

The following result on the asymptotic expectation of the 
distance to the origin of a base-b random walk is probably 


. bea 


known, but being unable to find any reference in the litera- 
ture, we provide a proof. 


THEOREM 3.1 The expected distance to the origin of a 
base-b random walk of N steps is asymptotically equal 


to /nN/2. 

PRooF. By the multivariate central limit theorem, the ran- 

dom variable 1/V/NS°Y_,(Xm— 2) is asymptotically 
ja baae ' 0 : i 

bivariate normal with mean 6 and covariance matrix M, 


where p is the two-dimensional mean vector of X and M is its 
2 x 2 covariance matrix. By applying Lagrange’s trigono- 
metric identities, one obtains 


sin((b+1/2)#4) 
n= b = cos (2k) aa —3+ 2sin(n/b) 
~ b 2 ~b cos 2)2 
i eas Sin ($k) 4 4cot(z/b) — late i) 
0 
= : 3 
& ‘3 
Thus, 
An 1 es cos? (72k) : cos (2k) sin (22k) 


~D Spam cos(2R) sin (72k) 
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(a) A million-step walk on a2,3 
base 3 (normal?). 


(c) A million-step walk on 
a2,3 base 2 (normal). 


(e) A million-step walk on 
EB(2) base 4 (normal?). 


Figure 7. Walks on various numbers in different bases. 


Since 
b b 1+ cos(itk 
cont (284) = y>t toon) _ 6 
k=l b k=l 2 2 
b a An 
- i? (Fe 3 cos( ) i 
k=1 b k=1 2 2 
b b dnp 
LS cos (F*) sin (F*) > ain( ) 0, 
k=1 b b mi 


(b) A 100,000-step walk on a2,3 
base 6 (nonnormal). 


a 


(d) A 100,000-step walk on 
Champernowne’s number C4 
base 4 (normal). 


(f) A million-step walk on CE(10) 
base 4 (normal?). 


formula (4) reduces to 


1 
u=|j 


NEO 


| (6 


Hence, 8‘ /\/N is asymptotically bivariate normal with mean 


0 : : ‘ 
0 and covariance matrix M. Because its components 


Si JN SN /./N T are uncorrelated, then they are inde- 
1 2 y 
pendent random variables, whose distribution is (univariate) 
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(a) 7 base 4 


(b) (Pseudo)random number base 4 


(c) The rational number Q1 base 4 


(e) Champernowne’s number C4 
base 4 


(d) a2,3 base 6 


(f) EB(2) base 4 


Figure 8. Horizontal color representation of a million digits of various numbers. 


normal with mean 0 and variance 1/2. Therefore, the random 
variable 


V2 ow)? (V2 ow (7) 
(Fast) +(Fe) 
converges in distribution to a x random variable with two 


degrees of freedom. Then, for N sufficiently large, 
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FF) -e( (a) Ee) 
_ VNTG/2) _ 
2 ry) 2 ®) 


where E(-) stands for the expectation of a random 
variable. Therefore, the expected distance to the 
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Table 1. Average of the normalized distance to the origin of the walk of 
various constants in different bases 


Number Base Steps Average normalized — Normal 
distance to the origin 
Mean of 10,000 4 1,000,000 1.00315 Yes 
random walks 
Mean of 10,000 walks 4 1,000,000 1.00083 2 
on the digits of x 

2,3 3 4,000,000 0.89275 2 
ap 3 4 4,000,000 0.25901 Yes 
ap 3 5 4,000,000 0.88104 2 
ap 3 6 4,000,000 108.02218 No 
ta,3 3 4,000,000 1.07223 2 
aa,3 4 4,000,000 0.24268 Yes 
4,3 6 1,000,000 94.54563 No 
4,3 12 1,000,000 371.24694 No 
03,5 3 1,000,000 0.32511 Yes 
03,5 5 1,000,000 0.85258 2? 
09,5 15 1,000,000 370.93128 No 
Tt 4 1,000,000 0.84366 ? 

Tt 6 1,000,000 0.96458 ? 

Tt 10 1,000,000 0.82167 ? 

ua 10 10,000,000 0.56856 2? 

ua 10 100,000,000 0.94725 ? 

1 10 1,000,000,000 0.59824 ? 

e 4 000,000 0.59583 ? 
v2 4 ,000,000 0.72260 2 
log 2 4 000,000 = 1.21113 ? 
Champernowne Cig = 10 ,000,000 59.91143 Yes 
EB(2) 4 000,000 6.95831 ? 
CE(10) 4 000,000 0.94964 ? 
Rational number Q,; 4 ,000,000 0.04105 No 
Rational number Qs 4 ,000,000 58.40222 No 
Euler constant 10 ,000,000 1.17216 2 
Fibonacci F 10 ,000,000 1.24820 ? 
(2) == 4 ,000,000 1.57571 ? 
(3) 4 000,000 1.04085 ? 
Catalan’s constantG 4 ,000,000 0.53489 2 
Thue-Morse TMo 4 ,000,000 531.92344 No 
Paper-folding P 4 1,000,000 0.01336 No 


origin of the random walk is asymptotically equal to 


VaN/2. 


As a consequence of this result, for any random walk of 
Nsteps in any given base, the expectation of the distance to the 
origin multiplied by 2//xN (which we will call normalized 
distance to the origin) must approach 1 as N goes to infinity. 
Therefore, for a “sufficiently” big random walk, one would 
expect that the arithmetic mean of the normalized distances 


Table 2. Number of points visited in various \V-step base-4 walks. The 
values of the two last columns are upper and lower bounds on the 
expectation of the number of distinct sites visited during an N-step 
random walk; see [31, Theorem 2] and [32] 


Number Steps Sites visited Bounds on the expectation 
of sites visited by a random 
walk 
Lower Upper 
bound bound 
Mean of 10,000 1,000,000 202,684 199,256 203,060 
random walks 
Mean of 10,000 1,000,000 202,385 199,256 203,060 
walks on the 
digits of x 
02,3 1,000,000 95,817 199,256 203,060 
4,3 1,000,000 68,613 199,256 203,060 
3,2 1,000,000 195,585 199,256 203,060 
1 1,000,000 204,148 199,256 203,060 
Tt 10,000,000 1,933,903 1,738,645 1,767,533 
Tt 100,000,000 16,109,429 15,421,296 15,648,132 
Tt 1,000,000,000 138,107,050 138,552,612 140,380,926 
e 1,000,000 176,350 199,256 203,060 
v2 1,000,000 200,733 199,256 203,060 
log 2 1,000,000 214,508 199,256 203,060 
Champernowne 1,000,000 548,746 199,256 203,060 
Cy 
EB(2) 1,000,000 279,585 199,256 203,060 
CE(10) 1,000,000 190,239 199,256 203,060 
Rational number 1,000,000 378 199,256 203,060 
Q, 
Rational number 1,000,000 939,322 199,256 203,060 
Qe 
Euler constant y 1,000,000 208,957 199,256 203,060 
(2) 1,000,000 188,808 199,256 203,060 
(3) 1,000,000 221,598 199,256 203,060 
Catalan’s 1,000,000 195,853 199,256 203,060 
constant G 
TM2 1,000,000 1,000,000 199,256 203,060 
Paper-folding P 1,000,000 21 199,256 203,060 


(which will be called average normalized distance to the 
origin) should be close to 1. 

We have created a sample of 10,000 (pseudo)random 
walks base-4 of one million points each in Python', and we 
have computed their average normalized distance to the ori- 
gin. The arithmetic mean of these numbers for the mentioned 
sample of pseudorandom walks is 1.0031, whereas its stan- 
dard deviation is 0.3676, so the asymptotic result fits quite well. 
We have also computed the normalized distance to the origin 
of 10,000 walks of one million steps each generated by the first 
ten billion digits of z. The resulting arithmetic mean is 1.0008, 
whereas the standard deviation is 0.3682. In Table 1 we show 
the average normalized distance to the origin of various 
numbers. There are several surprises in there data, such as the 


‘Python uses the Mersenne Twister as the core generator and produces 53-bit precision floats, with a period of 219°” — 1 = 10°°?, Compare the length of this period to the 
comoving distance from Earth to the edge of the observable universe in any direction, which is approximately 4.6 - 10°” nanometers, or to the number of protons in the universe, 


which is approximately 10°. 
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(a) (Pseudo)random walks. 
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(b) Walks based on the first 10 billion digits of 7. 
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Figure 9. Number of points visited by 10* base-4 million-step walks. 


fact that by this measure, Champernowne’s number C}po is far 
from what is expected of a truly “random” number. 


4 Number of Points Visited during an N-Step 
base-4 Walk 

The number of distinct points visited during a walk of a given 
constant (on a lattice) can be also used as an indicator of how 
“random” the digits of that constant are. It is well known that 
the expectation of the number of distinct points visited by an 
N-step random walk on a two-dimensional lattice is asymp- 
totically equal to z N/log(); see, for example, [36, pg. 338] or 
(13, pg. 27]. This result was first proven by Dvoretzky and 
Erdés [33, Thm. 1]. The main practical problem with this 
asymptotic result is that its convergence is rather slow; spe- 
cifically, it has order of O(N log log N/(logN)’). In (31, 32], 
Downham and Fotopoulos show the following bounds on the 
expectation of the number of distinct points, 


( m(N + 0.84) 


1.162 — 1 — log 2 + log(N + 2)’ 


(9) 


a(N +1) 

1.0662 — 1 — log2 + log(N + 1) }’ 
which provide a tighter estimate on the expectation than the 
asymptotic limit z N/log(N). For example, for N = 10°, these 
bounds are (199256.1, 203059.5), whereas 2N/log(N) = 
227396, which overestimates the expectation. 

In Table 2 we have calculated the number of distinct points 
visited by the base-4 walks on several constants. One can see 
that the values for different step walks on z fit quite well the 
expectation. On the other hand, numbers that are known to be 
normal such as #3 or the base-4 Champernowne number 
substantially differ from the expectation of a random walk. 
These constants, despite being normal, do not have a “ran- 
dom” appearance when one draws the associated walk, see 
Figure 7(d). 

At first look, the walk on «3 might seem random, see 
Figure 7(c). A closer look, shown in Figure 12, shows a more 
complex structure: the walk appears to be somehow self- 
repeating. This helps explain why the number of sites visited 
by the base-4 walk on 23 or 043 is smaller than the 
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expectation for a random walk. A detailed discussion of the 
Stoneham constants and their walks is provided in Section 5.2, 
where the precise structure of Figure 12 is conjectured. 


5 Copeland-Erdos, Stoneham, and Erdds— 
Borwein Constants 

As well as the classical numbers—such as e, 1, y—listed in the 

Appendix, we also considered various other constructions, 

which we describe in the next three subsections. 


5.1 Champernowne Number and Its Concatenated 
Relatives 

The first mathematical constant proven to be 10-normal is 
the Champernowne number, which is defined as the concat- 
enation of the decimal values of the positive integers, that 
is, Cio = 0.12345678910111213141516... Champernowne 
proved that Cio is 10-normal in 1933 [24]. This was later 
extended to base-b normality (for base-b versions of the 
Champernowne constant) as in Theorem 5.1. In 1946, 
Copeland and Erddés established that the corresponding 
concatenation of primes 0.23571113171923... and the con- 
catenation of composites 0.46891012141516..., among 
others, are also 10-normal [26]. In general they proved that 
concatenation leads to normality if the sequence grows slowly 
enough. We call such numbers concatenation numbers: 


THEOREM 5.1 ((26)). If a, ap, ... isan increasing sequence 
of integers such that for every @ <1 the number of a,’s up 
to N exceeds N’ provided N is sufficiently large, then the 
infinite decimal 


0.d1d2a3 °°: 


is normal with respect to the base b in which these integers 
are expressed. 


This result clearly applies to the Champernowne numbers 
(Fig. 7(d)), to the primes of the form ak + c with a and ¢ rel- 
atively prime, in any given base, and to the integers that are the 
sum of two squares (since every prime of the form 4k + 1 is 
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Figure 10. A walk on the first 100,000 bits of the primes 
(CE(2)) base two (normal). 


included). In further illustration, using the primes in binary 
leads to normality in base 2 of the number 


CE(2) =0.101110111110111101100011001110111111011 
1111100101101001101011...2, 


shown as a planar walk in Figure 10. 


5.1.1 Strong Normality 

In [14] it is shown that Cj fails the following stronger test of 
normality, which we now discuss. The test is is a simple one, in 
the spirit of Borel’s test of normality, as opposed to other more 
statistical tests discussed in [14]. If the digits of a real number « 
are chosen at random in the base b, the asymptotic frequency 
m,{n)/n of each 1-string approaches 1/b with probability 1. 
However, the discrepancy m,(n) — n/b does not approach 
any limit, but fluctuates with an expected value equal to the 
standard deviation \/(b—1)n/b. (Precisely m,(7) := 
#{i: a; = k,i <n} when «& has fractional part 0.d9a,d2°+- in 
base b.) 

Kolmogorov’s law of the iterated logarithm allows one to 
make a precise statement about the discrepancy of a random 
number. Belshaw and P. Borwein [14] use this to define their 
criterion and then to show that almost every number is 
absolutely strongly normal. 


DEFINITION 5.2 (Strong normality [14]). For real «, and 
m,(n)as above, ais simply strongly normal in the base bif for 
each 0 < k < b— 1one has 


»(2) — n/b 

lim sup ic A a = 
nc  /2nloglogn b (10) 
lim inf ua 
n-00 J2n loglogn b 


A number is strongly normal in base b if it is simply 
strongly normal in each base b,j = 1,2,3,..., and is abso- 
lutely strongly normal if it is strongly normal in every base. 


In paraphrase (absolutely) strongly normal numbers are 
those that distributionally oscillate as much as is possible. 

Belshaw and Borwein show that strongly normal numbers 
are indeed normal. They also make the important observation 
that Champernowne’s base-b number is not strongly normal in 
base b. Indeed, there are b"' digits of length v and they all start 
with a digit between 1 and b — 1 whereas the following 


v — 1 digits take values between 0 and b — 1 equally. In 
consequence, there is a dearth of zeroes. This is easiest to 
analyze in base 2. As illustrated below, the concatenated 
numbers start 


1,10, 11, 100, 101, 110, 111, 1000, 1001, 1010, 1011, 
1100, 1101, 1110, 1111 


For v = 3 there are 4 zeroes and 8 ones, for v = 4 there are 
12 zeroes and 20 ones, and for v = 5 there are 32 zeroes 
and 48 ones. 

Because the details were not provided in [14], we present 
them here. 


THEOREM 5.3 (Belshaw and P. Borwein) Champer- 
nowne’s base-2 number is is not 2-strongly normal. 


PRoorF. In general, let mp: = 1 + (R — 1)2* for k > 1. One 
has mo(m,) = 1 + (k — 12" and so 


m, (Np) — mo(Np) = Ne — 2mo(np) = 2" — 1. 


In fact m,(72) > mo(7) for all 1. To see this, suppose it true 
for 1 < nz, and proceed by induction on &. Let us arrange 
the digits of the integers 2*,2*+1,...,2*+2* 1-1 ina 
2*-! by k + 1 matrix, where the i-th row contains the digits 
of the integer 2 + i — 1. Each row begins 10, and if we 
delete the first two columns we obtain a matrix in which 
the 7th row is given by the digits of 7 — 1, possibly pre- 
ceded by some zeroes. Neglecting the first row and the 
initial zeroes in each subsequent row, we see the first 7,_1 
digits of Champernowne’s base-2 number, where by our 
induction hypothesis (7) > mp (7) for n < np_. 

If we now count all the zeroes as we read the matrix in the 
natural order, any excess of zeroes must come from the initial 


zeroes, and there are exactly 2*-! — 1 of these. As shown 
above, 12,(7,) — mo(n,) = 2® — 1, so m,(n) > mo(n) + 
2°! for every n < np + (k + 1) 2°71. A similar argument for 
the integers from 2” 4+ 2"! to 2&1 1. shows that 
m(n) > mn) for every m < Mp,4. Therefore, 2m,(1) > 
mn) + mn) = n forall n, and so 


lim inf = OTe >0 
n—co ,/2nloglogn 


and, as asserted, Champernowne’s base-2 number is not 
2-strongly normal. 


t seems likely that by appropriately shuffling the integers, 
one should be able to display a strongly normal variant. Along 
this line, Martin [40] has shown how to construct an explicit 
absolutely nonnormal number. 

Finally, although the log log limiting behavior required by 
(10) appears difficult to test numerically to any significant 
level, it appears reasonably easy computationally to check 
whether other sequences, such as many of the concatenation 
sequences of Theorem 5.1, fail to be strongly normal for 
similar reasons. 

Heuristically, we would expect the number CA(2) above to 


fail to be strongly normal, because each prime of length k both 
starts and ends with a one, whereas intermediate bits should 
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0.0 0.2 0.4 0.6 0.8 1.0 
le9 


(a) CE(2) (not strongly 2-normal ?). 


Plot of the first 10° values of 2@="/2_ 


Figure II. ~/iniealeat 
show no skewing. Indeed, for CE(2) we have checked that 
2m(n) >n for all n < 10°, see also Figure 11(a). Thus 
motivated, we are currently developing tests for strong nor- 
mality of numbers such as CE(2) and «23 below in binary. 
For 023, the corresponding computation of the first 10? 


values of ae leads to the plot in Figure 11(b) and leads 


us to conjecture that it is 2-strongly normal. 


5.2 Stoneham Numbers: A Class Containing Provably 
Normal and Nonnormal Constants 

Giving further motivation for these studies is the recent pro- 

vision of rigorous proofs of normality for the Stonebam 

numbers, which are defined by 


1 
Ape = y one”? 


m>1 


(11) 
for relatively prime integers b, c [10]. 


THEOREM 5.4 (Normality of Stoneham constants [3]). For 
every coprime pair of integers (b, c) with b > 2 and c > 2, 
the constant op = Soma We” 0") is b-normal. 


So, for example, the constant #23 = >°,s, 1/(3*23") = 
0.0418836808315030... is provably 2-normal. This special 
case was proven by Stoneham in 1973 [43]. More recently, 
Bailey and Misiurewicz were able to establish this normality 
result by a much simpler argument, based on techniques of 
ergodic theory [11] [16, pg. 141-173]. 

Equally interesting is the following result: 


THEOREM 5.5 (Nonnormality of Stoneham constants [3]). 
Given coprime integers b > 2 and c > 2, and integers p, 
qd, > 1, with neitber b nor c dividing r , let B= ? cr. 
Assume that the condition D = c¥?r'/?/b! <1 is satisfied. 
Then the constant 2p,¢ = rps 1/(ckb“) is Benonnormal. 


In various of the Figures and Tables, we explore the striking 


differences of behavior—proven and unproven—for a. as 
we vary the base. For instance, the nonnormality of #23 in 
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(b) a2,3 (strongly 2-normal ?). 


Figure 12. Zooming in on the base-4 walk on a3 of 
Figure 7(c) and Conjecture 5.6. 


base-6 digits was proved just before we started to draw walks. 
Contrast Figure 7(b) to Figure 7(c) and Figure 7(a). Now 
compare the values presented in Table 1 and Table 2. Clearly, 
from this sort of visual and numeric data, the discovery of other 
cases of Theorem 5.5 is very easy. 

As illustrated also in the “zoom” of Figure 12, we can use 
these images to discover more subtle structure. We conjecture 
the following relations on the digits of #23 in base 4 (which 
explain the values in Tables 1 and 2): 


CoNJECTURE 5.6 (Base-4 structure of 23). Denote 
by a, the R” digit of 23 in its base-4 expansion; that 
is, 2,3 = 7, ae/4*, with ap € {0,1,2,3} for all k. Then, 
forall n = 0,1,2,... one has: 


3(3"41)+3" ; 
en) . eter /2 (-1)""1-1 t "1, 1, nodd : 
7 2 a 1, neven’ 
k=3(3"41) , 


GD dp = Gpi3 = Apy2.3" forallk 
oo 33" +1) +3" 1. 


3(3” 4 


In Figure 13, we show the position of the walk after 
33" + 1),3(3" +1) +3" and 3(3”+1)+2-3”" steps for 
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Figure 13. A pattern in the digits of «23 base 4. We show 
only positions of the walk after 3(3” + 1),3(3” + 1) +3” and 
333" +1) +2-3” steps for n= 0,1,..., 11. 


n=0,1,...,11, which, together with Figures 7(c) and 12, 
graphically explain Conjecture 5.6. Similar results seem to 
hold for other Stoneham constants in other bases. For 
instance, for a3. base 3 we conjecture the following. 


CONJECTURE 5.7 (Base-3 structure of o3,). Denote by 
dp the RK” digit of 035 in its base-3 expansion; that is, 
03.5 = 0, ae/3", with ap € {0,1,2} for all k. Then, for 
all n = 0,1,2,... one has: 


245" 44.5" 


ey eter /2 = (-1)" (=) = e3n+2)ni/3, 

Raps 

Gi) ap = Gpyasn = Apigsn = Apyi2sy = Apyiose fork = 
srt 7 = 2,...,2+4-5". 


Along this line, Bailey and Crandall showed that, given a 
real number rin [0,1), and 7, denoting the k-th binary digit of 7, 
the real number 


oo 
1 
02,3(7) = ae (12) 
k=0 
is 2-normal. It can be seen that if r#s, then 


d23(7) # &3(s), so that these constants are all distinct. 
Thus, this generalized class of Stoneham constants is un- 
countably infinite. A similar result applies if 2 and 3 in this 
formula are replaced by any pair of co-prime integers 
(b, c) greater than 1, [10] [16, pg. 141-173]. We have not yet 
studied this generalized class by graphical methods. 


5.3 The Erdos-Borwein Constants 

The constructions of the previous two subsections exhaust 

most of what is known for concrete irrational numbers. By 

contrast, we finish this section with a truly tantalizing case: 
In a base b > 2, we define the Erdés—(Peter) Borwein 

constant EB(b) by the Lambert series [18]: 


EB(b) : 3 pee) (13) 


n>1 n>1 


where 1(72) is the number of divisors of 7. It is known that 
the numbers 5° ,,~, 1/(b” — 1”) are irrational for ra nonzero 
rational and b = 2,3,... such that r 4 b” for all 7 [20]. 


Whence, as provably irrational numbers other than the 
standard examples are few and far between, it is interesting 
to consider their normality. 

Crandall [27] has observed that the structure of (13) is 
analogous to the “BBP” formula for z (see [7, 16]) and used 
this, as well as some nontrivial knowledge of the arithmetic 
properties of t, to establish results such as that the googol-th 
bit (namely, the bit in position 10’ to the right of the “deci- 
mal” point) of EB(2) is a 1. 

In [27] Crandall also computed the first 2° bits (one Tbyte) 
of EB(2), which required roughly 24 hours of computation, 
and found that there are 4359105565638 zeroes and 
4436987450570 ones. There is a corresponding variation in the 
second and third place in the single-digit hex (base-16) dis- 
tributions. This certainly leaves some doubt as to its normality. 
Likewise, Crandall finds that in the first 1,000 decimal positions 
after the quintillionth digit 10"), the respective digit counts for 
digits 0, 1, 2, 3, 4, 5, 6, 7, 8, 9are 104, 82, 87, 100, 73, 126, 
87, 123, 114, 104. Our own more modest computations of 
EB(0) base-10 again leave it far from clear that EB(10) is 
10-normal. See also Figure 7(e) but contrast it to Figure 8(f). 

We should note that for computational purposes, we 
employed the identity 


1 e+ 1 
3 br —1 ee br —1 pr?’ 
n>1 


n>1 


for |b| > 1, due to Clausen, as did Crandall [27]. 


6 Other Avenues and Concluding Remarks 

Let us recall two further examples used in [14], that of Q(7), the 
Liouville function, which counts the parity of the number of 
prime factors of 7 (see Figure 14), and the human genome 
taken from the UCSC Genome Browser at http://hgdownload. 
cse.ucsc.edu/goldenPath/hg19/chromosomes/ (see Fig. 15). 
Note the similarity of the genome walk to the those of con- 
catenation sequences. We have explored a wide variety of 
walks on genomes, but we will reserve the results for a future 
study. 

We should emphasize that, to the best of our knowledge, 
the normality and transcendence status of the numbers 
explored is unresolved other than in the cases indicated in 
sections 5.1 and 5.2 and indicated in Appendix 7. Although 
one of the clearly nonrandom numbers (say Stoneham or 
Copeland-Erdés) may pass muster on one or other measure of 
the walk, it is generally the case that it fails another. Thus, the 
Liouville number Az (see Fig. 14) exhibits a much more 
structured drift than z or e, but looks more like them than like 
Figure 15(a). 

This situation provides hope for more precise future anal- 
yses. We conclude by remarking on some unresolved issues 
and plans for future research. 


6.1 Fractal and Box-Dimension 

Another approach is to estimate the fractal dimensions of 
walks, which is an appropriate tool with which to measure the 
geometrical complexity ofa set, characterizing its space-filling 
capacity (see, e.g., [6] for a nice introduction about fractals). 
The box-counting dimension, also known as the Minkowski- 
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(a) Directions used: >, t, <, J. 


(b) Directions used: 7, \,, \, Z. 


Figure 14. Two different rules for plotting a base-2 walk on the first two million values of A(m) (the Liouville number 22). 


(a) Human X. 


me 


Figure 15. Base-4 walks on 10° bases of the X-chromosome and 10° digits of log 2. 


Bouligand dimension, permits us to estimate the fractal 
dimension of a given set and often coincides with the fractal 
dimension. The box-dimension of the walk of numbers such 
as 7 turns out to be close to 2, whereas for nonrandom num- 
bers as 2,3 in base 6 or Champernowne’s number, the box- 
dimension is nearly 1. 


6.2 Three Dimensions 

We have also explored three-dimensional graphics—using 
base-6 for directions—both in perspective and in a large 
passive (glasses-free) three-dimensional viewer outside the 
CARMA laboratory; but we have not yet quantified these 
excursions. 


6.3 Genome Comparison 

Genomes are made up of so-called purine and pyrimidine 
nucleotides. In DNA, purine nucleotide bases are adenine and 
guanine (A and G), whereas the pyrimidine bases are thymine 
and cytosine (T and C). Thymine is replaced by uracyl in RNA. 
The haploid human genome (i.e., 23 chromosomes) is 


estimated to hold about 3.2 billion base pairs and so to contain 
20,000-25,000 distinct genes. Hence there are many ways of 
representing a stretch of a chromosome as a walk, say as a 
base-4 uniform walk on the symbols (A, G, T, C) illustrated in 
Figure 15 (where A, G, T, and C draw the new point to the 
south, north, west, and east, respectively, and we have not 
plotted undecoded or unused portions), or as a three- 
dimensional logarithmic walk inside a tetrahedron. 

We have also compared random chaos games in a square 
with genomes and numbers plotted by the same rules.” As an 
illustration, we show twelve games in Figure 16: four on a 
triangle, four ona square, and four ona hexagon. At each step 
we go from the current point halfway toward one of the ver- 
tices, chosen depending on the value of the digit. The color 
indicates the number of hits, in a similar manner as in Figure 6. 
The nonrandom behavior of the Champernowne numbers is 
apparent in the coloring patterns, as are the special features of 
the Stoneham numbers described in Section 5.2 (the non- 
normality of «2,3 and «3,9 in base 6 yields a paler color, whereas 
the repeating structure of #2 3 and a3 ; is the origin of the purple 
tone, see Conjectures 5.6 and 5.7). 


*The idea of a chaos game was described by Barnsley in his 1988 book Fractals Everywhere [6]. Games on amino acids seem to originate with [35]. For a recent summary see 


[17, pp. 194-2085]. 
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Figure 16. Chaos games on various numbers, colored by frequency. Row 1: C3, 03,5, a (pseudo)random number, and «23. Row 2: 


21. WALKING ON REAL NUMBERS (2013) 


C4, 7, a (pseudo)random number, and #3. Row 3: Cg, %3,2, a (pseudo)random number, and «2 3. 


(a) A thousand digits of the Thue— 
Morse sequence 7 M2 base 2. 


Figure 17. Walks on two automatic and nonnormal numbers. 


6.4 Automatic Numbers 

We have also explored numbers originating with finite state 
automata, such as those of the paper-folding and the Thue- 
Morse sequences, P and TMg, see [2] and Section 7. Automatic 
numbers are never normal and are typically transcendental; by 
comparison, the Liouville number /2 is not p-automatic for any 
prime p [25]. 

The walks on P and TM), have a similar shape, see 
Figure 17, but while the Thue—Morse sequence generates very 
large pictures, the paper-folding sequence generates very 
small ones, because it is highly self-replicating; see also the 
values in Tables 1 and 2. 

A turtle plot on these constants, where each binary digit 
corresponds to either “forward motion” of length 1 or 


fo 


(b) Ten million digits of the paper- 
folding sequence base 2. 


“rotate the Logo turtle” in a fixed angle, exhibits some of 
their striking features (see Fig. 18). For instance, drawn 
with a rotating angle of 2/3,7M2 converges to a Koch 
snowflake [41]; see Figure 18(c). We show a corresponding 
turtle graphic of m in Figure 18(d). Analogous features 
occur for the paper-folding sequence as described in [28, 
29, 30], and two variants are shown in Figures 18(a) and 
18(b). 


6.5 Continued Fractions 

Simple continued fractions often encode more information 
than base expansions about a real number. Basic facts are that 
a continued fraction terminates or repeats if and only if the 
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(a) Ten million digits of the paper- 
folding sequence with rotating angle 
a/3. 


(¢) Koch snowflake from 100,000 digits 
of the Thue—Morse sequence 7 M2 with 
rotating angle 7/3. 


(b) Dragon curve from one million dig- 
its of the paper-folding sequence with 
rotating angle 27/3. 


(d) One million digits of 7 with rotat- 
ing angle 7/3. 


Figure 18. Turtle plots on various constants with different rotating angles in base 2—where “0” gives forward motion and “1” 


rotation by a fixed angle. 


number is rational or a quadratic irrational, respectively; see 
(16, 7]. By contrast, the simple continued fractions for z and 
e start as follows in the standard compact form: 


n =([3,7, 15, 1, 292, 1, 1,1, 2,1, 3,1, 14, 2,1, 
1, 2, 2, 2, 2, 1,84, 2, 1,1, 15,3, 13, 1,4,...] 

e =[2,1,2,1,1,4,1,1,6,1,1,8,1,1,10,1,1,12,1,1,14,1, 
1,16, 1,1,18,1,1,20,1,..,], 


from which the surprising regularity of e and apparent 
irregularity of m as continued fractions is apparent. The 
counterpart to Borel’s theorem—that almost all numbers 
are normal—is that almost all numbers have “normal” 


continued fractions % = [41,d2,...,4n,---], for which the 
Gauss-Kuzmin distribution holds [16]: for each 
k=1,2,3,... 


1 
Prob{a,, = k} = — log, (: (e+ =): (14) 


56 = THE MATHEMATICAL INTELLIGENCER 


so that roughly 41.5% of the terms are 1, 16.99% are 2, 
9.31% are 3, etc. 

In Figure 19, we show a histogram of the first 100 million 
terms, computed by Neil Bickford and accessible at http:// 
neilbickford.com/picf-htm, of the continued fraction of 2. 
We have not yet found a satisfactory way to embed this in 
a walk ona continued fraction, but in Figure 20 we show base- 
4 walks on m and e where we use the remainder modulo 4 to 
build the walk (with 0 being right, 1 being up, 2 being left, and 
3 being down). We also show turtle plots on 7, e. 

Andrew Mattingly has observed that: 


PROPOSITION 6.1 With probability 1, a modA random 
walk (with 0 being right, 1 being up, 2 being left, and 3 being 
down) on the simple continued fraction coefficients of a real 
number is asymptotic to a line making a positive angle with 
the x-axis of: 


arctan G log,(1/2) — 1 
2log,(2/2) — 2log,(T(3/4)) 


) ~ 110.44°. 


358 


4.5 ner 


12 3 4 5 6 7 8 9 10 11 12 13 14 15 
(a) Histogram of the terms in green, Gauss— 
Kuzmin function in red. 
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(b) Difference between the expected and com- 
puted values of the Gauss—Kuzmin function. 


Figure 19. Expected values of the Gauss—Kuzmin distribution of (14) and the values of 100 million terms of the continued fraction 


of x. 


(a) A 100,000-step walk on the continued fraction of 7 


modulo 4. 


(c) A one million-step turtle walk 
on the continued fraction of 7 mod- 
ulo 2 with rotating angle 7/3. 


LPL PLP LPL 


(b) A 100-step walk on the continued frac- 
tion of e modulo 4. 


(d) A 100-step turtle walk on the 
continued fraction of e modulo 2 
with rotating angle 7/3. 


Figure 20. Uniform walks on a and e based on continued fractions. 


Proor. The result comes by summing the expected 
Gauss—Kuzmin probabilities of each step being taken as 
given by (14). 


This is illustrated in Figure 20(a) with a 90° anticlockwise 
rotation; thus making the case that one must have some 
a priori knowledge before designing tools. 

It is also instructive to compare digits and continued frac- 
tions of numbers as horizontal matrix plots of the form already 


used in Figure 8. In Figure 21, we show six pairs of million- 
term digit-strings and the corresponding continued fraction. 
In some cases both look random, in others one or the other 
does. 

In conclusion, we have only scratched the surface of what is 
becoming possible in a period in which data—for example, 
five-hundred million terms of the continued fraction or five- 
trillion binary digits of 7, full genomes, and much more—can 
be downloaded from the Internet, then rendered and visually 
mined, with fair rapidity. 
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Figure 21. Million-step comparisons of base-4 digits and continued fractions. Row 1: 02,3 (base 6) and Cy. Row 2: e and 2. Row 3: 
Q, and pseudorandom iterates; as listed from top left to bottom right. 


7 Appendix Selected Numerical Constants 

In what follows, x := 0.d\da2a3a4..., denotes the base- 
b expansion of the number «x, so that x = ei dpb-*. Base- 
10 expansions are denoted without a subscript. 


Catalan’s constant (irrational?; normal?): 


oo (-1)" 
G:= e+ 12 = 0.9159655941... (15) 
k=0 


Champernowne numbers (irrational; normal to correspond- 
ing base): 


mb- k[m—(b!-1)] 


m=! are 1 
=e bet m(b— 1b a 


Cio = 0.123456789101112... 
C4 = 0.1231011121320212223. . .4 


Copeland-Erdés_ constants (irrational; normal to 


corresponding base): 


= Saab (Hoe {log,2m|) 
hI a7) 


where pg is the R" prime number 


CE(10) = 0.2357111317... 
CE(2) = 0.1011101111...2 
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Exponential constant (transcendental; normal?): 


44 
e:= SP = 2.7182818284... (18) 


Erd6s—Borwein constants (irrational; normal?): 
foe} 
1 
st re (19) 
<—t bk 1 


EB(2) = 1.6066951524... = 1.212311001...4 


Euler—Mascheroni constant (irrational?,; normal?): 


k=1 


y= lim (355- Hem) = 0.5772156649... (20) 


Fibonacci constant (irrational [12, Theorem 2]; normal?): 


F = Fel 
k= 

ey -(e4) 
Ve 

= 0.011235813213455. .. 


—(1eE YT losio Fl) where 
, 


a 


Liouville number (irrational; not p-automatic): 


dg = ye) 2* (22) 
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where A(R) := (—1)% and Q(k) counts prime factors of k 
= 0.5811623188... = 0.10010100110. . .2 


Logarithmic constant (transcendental; normal?): 


ae 
log 2 := — 2 
og Dak (23) 
= 1 
= 0.6931471806... = 0.10110001011100100001. . .2 
Pi (transcendental; normal?): 
af V1 — x?dx ay (-1)" (24) 
T= 
a 2k+1 


k=0 


= 3.1415926535... = 11.00100100001111110110. . 2 


Riemann zeta function at integer arguments (transcen- 
dental for 7 even; irrational for 2 = 3; unknown for n > 5 
odd; normal?): 


1 
ts) = og (35 
=1 
In particular: 
2 
(2) = = = 1.64493406068... 
(2m) = (122g, 


2(2n)! ~ 
(where B2,, are Bernoulli numbers) 
5 es) (aie 
¢(3) = Apery’s constant = 224 Ba) 
= k 
= 1.2020569031... 
Stoneham constants (irrational; normal in some bases; non- 


normal in different bases; normality still is unknown other 
bases): 


(26) 


Sd 


k=1 


at2,3 = 0.0418836808. .. = 0.0022232032. . .4 
= 0.0130140430003334. . .¢ 

04,3 = 0.0052087571... = 0.0001111111301. . .4 
= 0.0010430041343502130000. . «5 

03,2 = 0.0586610287. .. = 0.0011202021212121...3 
= 0.0204005200030544000002. . . 

43,5 = 0.0008230452. .. = 0.00000012101210121. ..3 
= 0.002ba00000061d2. . .15 


Thue-—Morse constant (transcendental; 2-automatic, hence 
nonnormal): 


eevee 
TM? := ys Rn) 


where ¢(0) = 0, while ¢(27) = t(n) 


and ¢#(2n + 1) =1-<(n) (27) 


= 0.4124540336... 
= 0.01101001100101101001011001101001. . .2 


Paper-folding constant (transcendental; 2-automatic, hence 
nonnormal): 


oo 92" 
P:= =p = 0.8507361882. . . 
> aT 3 


= 0.1101100111001001.. .2 (28) 
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and computation of pi to ten trillion digits,” Advances in Di erence Equations, 
2013:100, p. 1-59. 


Synopsis: 

This paper presents one of the most complete and up-to-date chronologies of the 
analysis and computation of 7 through the ages, from approximations used by In- 
dian and Babylonian mathematicians, well before the time of Christ, to Archimedes 
of Syracuse, “who ranks with Newton and Gauss as one of the three greatest mathe- 
maticians who ever lived,” to mathematicians in the Islamic world during the “dark 
ages,” and on to mathematicians in Renaissance Europe, including Francois Viete, 
Ludolph van Ceulen, John Wallis, Isaac Newton, John Machin, Leonard Euler, 
William Shanks and many others. 

In the twentieth century, the chronology continues with summaries of the work 
of Ramanujan, then computer-age calculations beginning with Wrench and Smith, 
Guilloud and Fillatre, and, in more recent years, the work of Gosper, Brent, Salamin 
and others who either development new mathematics related to a or performed 
prodigious computer-based calculations of 7,, right up to and including the com- 
putation of 7 to ten trillion digits by Kondo and Yee, which, as of this writing, 
is almost the state-of-the-art (Kondo and Yee have subsequently computed twelve 
trillion digits). 

The paper includes many formula and techniques used, all of which is spelled 
out in considerable detail. 


Keywords: Algorithms, Computation, Curiosities, History, Series 
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Abstract 


The universal real constant pi, the ratio of the circumference of any circle and its 
diameter, has no exact numerical representation in a finite number of digits in any 
number/radix system. It has conjured up tremendous interest in mathematicians and 
non-mathematicians alike, who spent countless hours over millennia to explore its 
beauty and varied applications in science and engineering. The article attempts to 
record the pi exploration over centuries including its successive computation to ever 
increasing number of digits and its remarkable usages, the list of which is not yet 
closed. 


Keywords: circle; error-free; history of pi; Matlab; random sequence; stability of a 
computer; trillion digits 


All circles have the same shape, and traditionally represent the infinite, immeasurable and 
even spiritual world. Some circles may be large and some small, but their ‘circleness, their 
perfect roundness, is immediately evident. Mathematicians say that all circles are similar. 
Before dismissing this as an utterly trivial observation, we note by way of contrast that not 
all triangles have the same shape, nor all rectangles, nor all people. We can easily imag- 
ine tall narrow rectangles or tall narrow people, but a tall narrow circle is not a circle at 
all. Behind this unexciting observation, however, lies a profound fact of mathematics: that 
the ratio of circumference to diameter is the same for one circle as for another. Whether 
the circle is gigantic, with large circumference and large diameter, or minute, with tiny 
circumference and tiny diameter, the relative size of circumference to diameter will be ex- 
actly the same. In fact, the ratio of the circumference to the diameter of a circle produces, 
the most famous/studied/unlimited praised/intriguing/ubiquitous/external/mysterious 
mathematical number known to the human race. It is written as pi or as 2 [1-210], sym- 


bolically, and defined as 


. distance around a circle ic 
i 


= = =. 
distance across and through the center of the circle D 
Since the exact date of birth of z is unknown, one could imagine that 2 existed before 
the universe came into being and will exist after the universe is gone. Its appearance in 
the disks of the Moon and the Sun, makes it as one of the most ancient numbers known 
to humanity. It keeps on popping up inside as well as outside the scientific community, 
for example, in many formulas in geometry and trigonometry, physics, complex analysis, 
cosmology, number theory, general relativity, navigation, genetic engineering, statistics, 
© 2013 Agarwal et al.; licensee Springer. This is an Open Access article distributed under the terms of the Creative Commons 


Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction 
in any medium, provided the original work is properly cited. 
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fractals, thermodynamics, mechanics, and electromagnetism. Pi hides in the rainbow, and 
sits in the pupil of the eye, and when a raindrop falls into water 7 emerges in the spreading 
rings. Pi can be found in waves and ripples and spectra of all kinds and, therefore, 7 occurs 
in colors and music. The double helix of DNA revolves around 7. Pi has lately turned up in 
super-strings, the hypothetical loops of energy vibrating inside subatomic particles. Pi has 
been used as a symbol for mathematical societies and mathematics in general, and built 
into calculators and programming languages. Pi is represented in the mosaic outside the 
mathematics building at the Technische Universitat Berlin. Pi is also engraved on a mosaic 
at Delft University. Even a movie has been named after it. Pi is the secret code in Alfred 
Hitchcock's ‘Torn Curtain’ and in “The Net’ starring Sandra Bullock. Pi day is celebrated 
on March 14 (which was chosen because it resembles 3.14). The official celebration be- 
gins at 1:59 p.m., to make an appropriate 3.14159 when combined with the date. In 2009, 
the United States House of Representatives supported the designation of Pi Day. Albert 
Einstein was born on Pi Day (14 March 1879). 

Throughout the history of z, which according to Beckmann (1971) ‘is a quaint little mir- 
ror of the history of man; and James Glaisher (1848-1928) ‘has engaged the attention of 
many mathematicians and calculators from the time of Archimedes to the present day, 
and has been computed from so many different formula, that a complete account of its 
calculation would almost amount to a history of mathematics, one of the enduring chal- 
lenges for mathematicians has been to understand the nature of the number z (ratio- 
nal/irrational/transcendental), and to find its exact/approximate value. The quest, in fact, 
started during the pre-historic era and continues to the present day of supercomputers. 
The constant search by many including the greatest mathematical thinkers that the world 
produced, continues for new formulas/bounds based on geometry/algebra/analysis, re- 
lationship among them, relationship with other numbers such as z = 5cos!(o/2), 7X 
4/,/¢, where ¢ is the Golden section (ratio), and e” +1 = 0, which is due to Euler and 
contains 5 of the most important mathematical constants, and their merit in terms of 
computation of digits of 7. Right from the beginning until modern times, attempts were 
made to exactly fix the value of 7, but always failed, although hundreds constructed cir- 
cle squares and claimed the success. These amateur mathematicians have been called the 
sufferers of morbus cyclometricus, the circle-squaring disease. Stories of these contribu- 
tors are amusing and at times almost unbelievable. Many came close, some went to tens, 
hundreds, thousands, millions, billions, and now up to ten trillion (101°) decimal places, 
but there is no exact solution. The American philosopher and psychologist William James 
(1842-1910) wrote in 1909 ‘the thousandth decimal of Pi sleeps there though no one may 
ever try to compute it! Thanks to the twentieth and twenty-first century, mathematicians 
and computer scientists, it sleeps no more. In 1889, Hermann Schubert (1848-1911), a 
Hamburg mathematics professor, said ‘there is no practical or scientific value in knowing 
more than the 17 decimal places used in the foregoing, already somewhat artificial, appli- 
cation, and according to Arndt and Haenel (2000), just 39 decimal places would be enough 
to compute the circumference of a circle surrounding the known universe to within the 
radius of a hydrogen atom. Further, an expansion of z to only 47 decimal places would 
be sufficiently precise to inscribe a circle around the visible universe that does not deviate 
from perfect circularity by more than the distance across a single proton. The question 
has been repeatedly asked why so many digits? Perhaps the primary motivation for these 
computations is the human desire to break records; the extensive calculations involved 
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have been used to test supercomputers and high-precision multiplication algorithms (a 
stress test for a computer, a kind of ‘digital cardiogram’), the statistical distribution of the 
digits, which is expected to be uniform, that is, the frequency with which the digits (0 to 
9) appear in the result will tend to the same limit (1/10) as the number of decimal places 
increases beyond all bounds, and in recent years these digits are being used in applied 
problems as a random sequence. It appears experts in the field of z are looking for sur- 
prises in the digits of z. In fact, the Chudnovsky brothers once said: ‘We are looking for the 
appearance of some rules that will distinguish the digits of z from other numbers. If you 
see a Russian sentence that extends for a whole page, with hardly a comma, it is definitely 
Tolstoy. If someone gave you a million digits from somewhere in z, could you tell it was 
from x’? Some interesting observations are: The first 144 digits of 2 add up to 666 (which 
many scholars say is ‘the mark of the Beast’); Since there are 360 degrees in a circle, some 
mathematicians were delighted to discover that the number 360 is at the 359th digit posi- 
tion of z. A mysterious 2008 crop circle in Britain shows a coded image representing the 
first 10 digits of 2. The Website ‘The Pi-Search Page’ finds a person’s birthday and other 
well-known numbers in the digits of 2. Several people have endeavored to memorize the 
value of m with increasing precision, leading to records of over 100,000 digits. 

We believe that the study and discoveries of 2 will never end; there will be books, re- 
search articles, new record-setting calculations of the digits, clubs and computer programs 
dedicated to z. In what follows, we shall discuss the growth and the computation of z 
chronologically. For our ready reference, we also give some digits of 7, 


W = 3.1415926535897932384.6264338327950288419716939937510 


582097494.45923078164.062862089986280348253421170679. 


About 3200 BC. The meaning of the word sulv is to measure, and geometry in ancient 
India came to be known by the name sulba or sulva. Sulbasutras means ‘rule of chords, 
which is another name for geometry. The Sulbasutras are part of the larger corpus of texts 
called the Shrautasutras, considered to be appendices to the Vedas, which give rules for 
constructing altars. If the ritual sacrifice was to be successful, then the altar had to conform 
to very precise measurements, so mathematical accuracy was seen to be of the utmost 
importance. The sulbas contain a large number of geometric constructions for squares, 
rectangles, parallelograms and trapezia. Sulbas also contain remarkable approximations 


1 1 1 
22°1+-+——->——_, 
v2 3. 3-4 3-4-34 


which gives /2 = 1.4142156..., and 


6 2 
mt & 18(3 — 2/2) = (5a) ; 
which gives 7 = 3.088311.... 

About 2742 BC. Aryabhatta was born in 2765 BC in Patliputra in Magadha, modern 
Patna in Bihar (India). He was teaching astronomy and mathematics when he was 23 years 
of age in 2742 BC. His astronomical knowledge was so advanced that he could claim that 
the Earth rotated on its own axis, the Earth moves round the Sun and the Moon rotates 
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round the Earth; incredibly he believed that the orbits of the planets are ellipses. He talks 
about the position of the planets in relation to its movement around the Sun. He refers to 
the light of the planets and the Moon as reflection from the Sun. He explains the eclipse 
of the Moon and the Sun, day and night, the length of the year exactly as 365 days. He 
calculated the circumference of the Earth as 24,835 miles, which is close to modern day 
calculation of 24,900 miles. In his Aryabhattiyam, which consists of the 108 verses and 
13 introductory verses, and is divided into four padas or chapters (written in the very 
terse style typical of sutra literature, in which each line is an aid to memory for a com- 
plex system), Aryabhatta included 33 verses giving 66 mathematical rules ganita on pure 
mathematics. He described various original ways to perform different mathematical op- 
erations, including square and cube roots and solving quadratic equations. He provided 
elegant results for the summation of series of squares and cubes. He made use of dec- 
imals, the zero (sunya) and the place value system. To find an approximate value of 7, 
Aryabhatta gives the following prescription: Add 4 to 100, multiply by 8 and add to 62,000. 
This is ‘approximately’ the circumference of a circle whose diameter is 20,000. This means 
= 62,832/20,000 = 3.1416. It is important to note that Aryabhatta used the word asanna 
(approaching), to mean that not only is this an approximation of z, but that the value is 
incommensurable or irrational, i.e., it cannot be expressed as a ratio of two integers. 

About 2600 BC. Great pyramid at Gizeh was built around 2600 BC in Egypt. It is one 
of the most massive buildings ever erected. It has at least twice the volume and thirty times 
the mass (the resistance an object offers to a change in its speed or direction of motion) 
of the Empire Sate Building in New York, and built from individual stones weighing up to 
70 tons each. From the dimensions of the Great Pyramid, it is possible to derive the value 
of 7, namely, z = half the perimeter of the base of the pyramid, divided by its height = 3 + 
1/7 ~ 3.14285.... 

About 2000 BC. In a tablet found in 1936 in Susa (Iraq), Babylonians used the value 


3. 57 36 


x 60 (60)2’ 


which yields z = 3 1/8 = 3.125. They were also satisfied with z = 3. 

About 2000 BC. Ahmes (around 1680-1620 BC) (more accurately Ahmose) was an 
Egyptian scribe. A surviving work of Ahmes is part of the Rhind Mathematical Papyrus, 
1650 BC (named after the Scottish Egyptologist Alexander Henry Rhind who went to 
Thebes for health reasons, became interested in excavating and purchased the papyrus 
in Egypt in 1858) located in the British Museum since 1863. When new, this papyrus was 
about 18 feet long and 13 inches high. Ahmes states that he copied the papyrus from a now- 
lost Middle Kingdom original, dating around 2000 BC. This curious document entitled 
directions for knowing all dark things, deciphered by Eisenlohr in 1877, is a collection of 
problems in geometry and arithmetic, algebra, weights and measures, business and recre- 
ational diversions. The 87 problems are presented with solutions, but often with no hint as 
to how the solution was obtained. In problem no. 50, Ahmes states that a circular field with 
a diameter of 9 units in area is the same as a square with sides of 8 units, i.e., (9/2)? = 8?, 
and hence the Egyptian value of z is 


8\2 
mw=4x (5) = 3.16049..., 
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which is only very slightly worse than the Babylonians value, and in contrast to the latter, an 
overestimation. We have no idea how this very satisfactory result was obtained (probably 
empirically), although various justifications are available. Maya value of 7 was as good as 
that of the Egyptians. 

About 1200 BC. The earliest Chinese mathematicians, from the time of Chou-Kong 
used the approximation z = 3. Some of those who used this approximation were mathe- 
maticians of considerable attainments in other respects. According to the Chinese mythol- 
ogy, 3 is used because it is the number of the Heavens and the circle. 

About 950 BC. In the Old Testament (I Kings vii.23, and 2 Chronicles iv.2), we find 
the following verse: ‘Also, he made a molten sea of ten cubits from brim to brim, round in 
compass, and five cubits the height thereof; and a line of thirty cubits did compass it round 
about: Hence the biblical value of z is 30/10 = 3. The Jewish Talmud, which is essentially a 
commentary on the Old Testament, was published about 500 AD. This shows that the Jews 
did not pay much attention to geometry. However, debates have raged on for centuries 
about this verse. According to some, it was just a simple approximation, while others say 
that ‘...the diameter perhaps was measured from outside, while the circumference was 
measured from inside’ 

About 900 BC. Shatapatha Brahmana (Priest manual of 100 paths) is one of the prose 
texts describing the Vedic ritual. It survives in two recensions, Madhyandina and Kanva, 
with the former having the eponymous 100 brahmanas in 14 books, and the latter 104 
brahmanas in 17 books. In these books, z is approximated by 339/108 = 3.138888.... 

About 440 BC. Anaxagoras of Clazomanae (500-428 BC) came to Athens from near 
Smyrna, where he taught the results of the Ionian philosophy. He neglected his possessions 
in order to devote himself to science, and in reply to the question, what was the object of 
being born, he remarked: ‘The investigation of the Sun, Moon and heaven’ He was the 
first to explain that the Moon shines due to reflected light from the Sun, which explains 
the Moons phases. He also said that the Moon had mountains and he believed that it was 
inhabited. Anaxagoras gave some scientific accounts of eclipses, meteors, rainbows, and 
the Sun, which he asserted was larger than the Peloponnesus: this opinion, and various 
other physical phenomena, which he tried to explain which were supposed to have been 
direct action of the Gods, led him to a prosecution for impiety. While in prison he wrote a 
treatise on the quadrature of the circle. (The general problem of squaring a figure came to 
be known as the quadrature problem.) Since that time, hundreds of mathematicians tried 
to find a way to draw a square with equal area to a given circle; some maintained that they 
have found methods to solve the problem, while others argued that it is impossible. We 
will see that the problem was finally laid to rest in the nineteenth century. 

About 430 BC. Hippocrates of Chios was born about 470 BC, and began life as a mer- 
chant. About 430 BC he came to Athens from Chios and opened a school of geometry, 
and began teaching, thus became one of the few individuals ever to enter the teaching pro- 
fession for its financial rewards. He established the formula zr? for the area of a circle in 
terms of its radius. It means that a certain number z exists, and is the same for all circles, 
although his method does not give the actual numerical value of z. In trying to square 
the circle (unsuccessfully), Hippocrates discovered that two moon-shaped figures (lunes, 
bounded by pair of circular arcs) could be drawn whose areas were together equal to that 
of a right-angled triangle. Hippocrates gave the first example of constructing a rectilinear 
area equal to an area bounded by one or more curves. 
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About 430 BC. Antiphon of Rhamnos (around 480-411 BC) was a sophist who at- 
tempted to find the area of a circle by considering it as the limit of an inscribed regu- 
lar polygon with an infinite number of sides. Thus, he provided preliminary concept of 
infinitesimal calculus. 

About 420 BC. Bryson of Heraclea was born around 450 BC. He was a student of 
Socrates. Bryson considered the circle squaring problem by comparing the circle to poly- 
gons inscribed within it. He wrongly assumed that the area of a circle was the arithmetical 
mean between circumscribed and inscribed polygons. 

About 420 BC. Hippias of Elis was born about 460 BC. He was a Greek Sophist, a 
younger contemporary of Socrates. He is described as an expert arithmetician, but he is 
best known to us through his invention of a curve called the quadratrix (x = ycot(zy/2)), 
by means of which an angle can be trisected, or indeed divided in any given ratio. It is not 
known whether Hippias realized that by means of his curve the circle could be squared; 
perhaps he realized but could not prove it. He lectured widely on mathematics and as 
well on poetry, grammar, history, politics, archeology and astronomy. Hippias was also a 
prolific writer, producing elegies, tragedies and technical treatises in prose. His work on 
Homer was considered excellent. 

414 BC. Aristophanes (446-386 BC) in his play The Birds makes fun of circle squarers. 

Around 375 BC. Plato of Athens (around 427-347 BC) was one of the greatest Greek 
philosophers, mathematicians, mechanician, a pupil of Socrates for eight years, and 
teacher of Aristotle. He is famous for ‘Plato’s Academy’ ‘Let no man ignorant of math- 
ematics enter here’ is supposed to have been inscribed over the doors of the Academy. He 
is supposedly obtained for his day a fairly accurate value for 1 = /2 + /3 = 3.146.... 

About 370 BC. Eudoxus of Cnidus (around 400-347 BC) was the most celebrated math- 
ematician. He developed the theory of proportion, partly to place the doctrine of in- 
commensurables (irrationals) upon a thoroughly sound basis. Specially, he showed that 
the area of a circle is proportional to its diameter squared. Eudoxus established fully the 
method of exhaustions of Antiphon by considering both the inscribed and circumscribed 
polygons. He also considered certain curves other than the circle. He explained the ap- 
parent motions of the planets as seen from the earth. Eudoxus also wrote a treatise on 
practical astronomy, in which he supposed a number of moving spheres to which the Sun, 
Moon and stars were attached, and which by their rotation produced the effects observed. 
In all, he required 27 spheres. 

About 350 BC. Dinostratus (around 390-320 BC) was a Greek mathematician. He used 
Hippias quadratrix to square the circle. For this, he proved Dinostratus’ theorem. Hippias 
quadratrix later became known as the Dinostratus quadratrix also. However, his demon- 
stration was not accepted by the Greeks as it violated the foundational principles of their 
mathematics, namely, using only ruler and compass. 

About 240 BC. Archimedes of Syracuse (287-212 BC) ranks with Newton and Gauss 
as one of the three greatest mathematicians who ever lived, and he is certainly the 
greatest mathematician of antiquity. Galileo called him ‘divine Archimedes, superhu- 
man Archimedes’; Sir William Rowan Hamilton (1805-1865) remarked ‘who would not 
rather have the fame of Archimedes than that of his conqueror Marcellus’?; Alfred North 
Whitehead (1861-1947) commented ‘no Roman ever died in contemplation over a geomet- 
rical diagram’; Godfrey Harold Hardy (1877-1947) said ‘Archimedes will be remembered 


when Aeschylus is forgotten, because languages die and mathematical ideas do not’; and 
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Figure 1 Circle with diameter 1. 


Voltaire remarked ‘there was more imagination in the head of Archimedes than in that of 
Homer: His mathematical work is so modern in spirit and technique that it is barely dis- 
tinguishable from that of a seventeenth-century mathematician. Among his mathemati- 
cal achievements, Archimedes developed a general method of exhaustion for finding ar- 
eas bounded by parabolas and spirals, and volumes of cylinders, parabolas, segments of 
spheres, and specially to approximate 7, which he called as the parameter to diameter. His 
approach to approximate z is based on the following fact: the circumference ofa circle lies 
between the perimeters of the inscribed and circumscribed regular polygons (equilateral 
and equiangular) of 1 sides, and as n increases, the deviation of the circumference from 
the two perimeters becomes smaller. Because of this fact, many mathematicians claim that 
it is more correct to say that a circle has an infinite number of corners than to view a circle 
as being cornerless. If a,, and b,, denote the perimeters of the inscribed and circumscribed 
regular polygons of x sides, and C the circumference of the circle, then it is clear that {a,} 
is an increasing sequence bounded above by C, and {b,} is a decreasing sequence bounded 
below by C. Both of these sequences converge to the same limit C. To simplify matters, 
suppose we choose a circle with the diameter 1, then from Figure 1 it immediately follows 
that 


. Us 
ad,=nsin— and b,=ntan—. (1) 
n n 


It is clear that limy—.90 4, = 1 = limy-+. b,. Further, bz, is the harmonic mean of a, and 
b,, and a2, is the geometric mean of a, and byy, i.e., 


2anby, 
bay = and day = Vanbon- (2) 


From (1) for the hexagon, i.e., n = 6 it follows that ae = 3, be = 2./3. Then Archimedes 
successively took polygons of sides 12, 24, 48 and 96, used the recursive relations (2), and 
the inequality 


265 1,351 
pany 3 : 
153 <V3< 780 


which he probably found by what is now called Heron’s method, to obtain the bounds 


10 1 
ea | oes = 3.142857.... (3) 
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It is interesting to note that during Archimedes time algebraic and trigonometric no- 
tations, and our present decimal system were not available, and hence he had to de- 
rive recurrence relations (2) geometrically, and certainly for him the computation of do¢ 
and bog must have been a formidable task. The approximation 22/7 is often called the 
Archimedean value of 1, and it is good for most purposes. If we take the average of the 
bounds given in (3), we obtain m = 3.141851.... The above method of computing a by 
using regular inscribed and circumscribed polygons is known as the classical method of 
computing z. It follows that an inscribed regular polygon of 2” sides takes up more than 
1-1/2"" of the area of a circle. Heron of Alexandria (about 75 AD) in his Metrica, which 
had been lost for centuries until a fragment was discovered in 1894, followed by a complete 
copy in 1896, refers to an Archimedes work, where he gives the bounds 

_ 211,875 197,888 


= << = 3.173774.... 
67,441 62,351 


3.14163... 


Clearly, in the above right inequality, there is a mistake as it is worse than the upper bound 
22/7 found by Archimedes earlier. Heron adds ‘Since these numbers are inconvenient 
for measurements, they are reduced to the ratio of the smaller numbers, namely, 22/7: 
Archimedes’ polygonal method remained unsurpassed for 18 centuries. Archimedes also 
showed that a curve discovered by Conon of Samos (around 280-220 BC) could, like Hip- 
pias’ quadratrix, be used to square the circle. The curve is today called the Archimedean 
Spiral. 

About 123 BC. Daivajna Varahamihira (working 123 BC) was an astronomer, math- 
ematician and astrologer. His picture may be found in the Indian Parliament along 
with Aryabhata. He was one of the nine jewels (Navaratnas) of the court of legendary 
king Vikramaditya I (101-18 BC). In 123 BC, Varahamihira wrote Pancha-Siddhanta 
(The Five Astronomical Canons), in which he codified the five existing Siddhantas, 
namely, Paulisa Siddhanta, Romaka Siddhanta, Vasishtha Siddhanta, Surya Siddhanta and 
Paitamaha Siddhanta. He also made some important mathematical discoveries such as 
giving certain trigonometric formulae; developing new interpolation methods to produce 
sine tables; constructing a table for the binomial coefficients; and examining the pandiag- 
onal magic square of order four. In his work, he approximated z as /10. 

15 BC. Marcus Vitruvius Pollio (about 80-5 BC), a Roman writer, architect and engineer, 
in his multi-volume work De Architectura (On Architecture) used the value z = 3 1/8 = 
3.125, which is the same as Babylonians had used 2,000 years earlier. He was the first to 
describe direct measurement of distances by the revolution of a wheel. 

About 10 BC. Liu Xin (Liu Hsin) (about 50 BC-23 AD) was an astronomer, historian and 
editor during the Xin Dynasty (9-23 AD). Liu created a new astronomical system, called 
Triple Concordance. He was the first to give a more accurate calculation of m as 3.1547, 
the exact method he used to reach this figure is unknown. This was first mentioned in the 
Sui shu (387-388). He also found the approximations 3.1590, 3.1497 and 3.1679. 

Around 5 AD. Liu Xin (50 BC-AD 23) was a Chinese astronomer, historian and editor 
during the Xin Dynasty (9-23 AD). He was the son of Confucian scholar Liu Xiang (77-6 
BC). Liu created a catalog of 1,080 stars, where he used the scale of 6 magnitudes. He was 
the first in China to give a more accurate calculation of z as 3.1457. The method he used 


to reach this figure is unknown. 
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10 AD. Brahmagupta (born 30 BC) wrote two treatises on mathematics and astron- 
omy: the Brahmasphutasiddhanta (The Correctly Established Doctrine of Brahma) but 
often translated as (The Opening of the Universe), and the Khandakhadyaka (Edible Bite) 
which mostly expands the work of Aryabhata. As a mathematician he is considered as 
the father of arithmetic, algebra, and numerical analysis. Most importantly, in Brahmas- 
phutasiddhanta he treated zero as a number in its own right, stated rules for arithmetic 
on negative numbers and zero, and attempted to define division by zero, particularly he 
wrongly believed that 0/0 was equal to 0. He used a geometric construction for squaring 
the circle, which amounts to x = s/10. 

125. Zhang Heng (78-139 AD) was an astronomer, mathematician, inventor, geogra- 
pher, cartographer, artist, poet, statesman and literary scholar. He proposed a theory of 
the universe that compared it to an egg. ‘The sky is like a hen’s egg and is as round as a 
crossbow pellet. The Earth is like the yolk of the egg, lying alone at the center. The sky is 
large and the Earth is small: According to him the universe originated from chaos. He said 
that the Sun, Moon and planets were on the inside of the sphere and moved at different 
rates. He demonstrated that the Moon did not have independent light, but that it merely 
reflected the light from the sun. He is most famous in the West for his rotating celestial 
globe, and inventing in 132 the first seismograph for measuring earthquakes. He proposed 
/10 (about 3.1623) for 2. He also compared the celestial circle to the width (i.e., diameter) 
of the earth in the proportion of 736 to 232, which gives z as 3.1724. 

150. Claudius Ptolemaeus (around 90-168 AD) known in English as Ptolemy, was a 
mathematician, geographer, astrologer, poet of a single epigram in the Greek Anthology, 
and most importantly astronomer. He made a map of the ancient world in which he em- 
ployed a coordinate system very similar to the latitude and longitude of today. One of his 
most important achievements was his geometric calculations of semichords. Ptolemy in 
his famous Syntaxis mathematica (more popularly known by its Arabian title of the Al- 
magest), the greatest ancient Greek work on astronomy, obtained, using chords of a circle 
and an inscribed 360-gon, an approximate value of in sexagesimal notation, as 3 8’ 30”, 
which is the same as 377/120 = 3.141666.... Eutocius of Ascalon (about 480-540) refers 
to a book Quick delivery by Apollonius of Perga (around 262-200 BC), who earned the 
title ‘The Great Geometer; in which Apollonius obtained an approximation for 2, which 
was better than known to Archimedes, perhaps the same as 377/120. 

250. Wang Fan (228-266) was a mathematician and astronomer. He calculated the dis- 
tance from the Sun to the Earth, but his geometric model was not correct. He has been 
credited with the rational approximation 142/45 for z, yielding 7 = 3.155. 

263. Liu Hui (around 220-280) wrote two works. The first one was an extremely im- 
portant commentary on the Jiuzhang suanshu, more commonly called Nine Chapters on 
the Mathematical Art, which came into being in the Eastern Han Dynasty, and believed to 
have been originally written around 1000 BC. (It should be noted that very little is known 
about the mathematics of ancient China. In 213 BC, the emperor Shi Huang of the Chin 
dynasty had all of the manuscript of the kingdom burned.) The other was a much shorter 
work called Haidao suanjing or Sea Island Mathematical Manual. In Jiuzhang suanshu, 
Liu Hui used a variation of the Archimedean inscribed regular polygon with 192 sides to 
approximate z as 3.141014 and suggested 157/50 = 3.14 as a practical approximation. 

About 330. Pappus of Alexandria (around 290-350) was born in Alexandria, Egypt, and 


either he was a Greek or a Hellenized Egyptian. The written records suggest that, Pappus 
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lived in Alexandria during the reign of Diocletian (284-305). His major work is Synagoge 
or the Mathematical Collection, which is a compendium of mathematics of which eight 
volumes have survived. Pappus’ Book IV contains various theorems on circles, study of 
various curves, and an account of the three classical problems of antiquity (the squaring 
of the circle, the duplication of a cube, and the trisection of an angle). For squaring the 
circle, he used Dinostratus quadratrix and his proof is a reductio ad absurdum. Pappus is 
remembered for Pappus’s centroid theorem, Pappus’s chain, Pappus’s harmonic theorem, 
Pappus’s hexagon theorem, Pappus’s trisection method, and for the focus and directrix of 
an ellipse. 

400. He Chengtian (370-447) gave the approximate value of m as 111,035/35,329 = 
3.142885.... 

475. Tsu Ch’ung-chih (Zu Chongzhi) (429-500) created various formulas that have 
been used throughout history. With his son he used a variation of Archimedes method 
to find 3.1415926 < m < 3.1415927. He also obtained a remarkable rational approximation 
355/113, which yields z correct to six decimal digits. In Chinese this fraction is known as 
Mili. To compute this accuracy for 7, he must have taken an inscribed regular 6 x 2!?- 
gon and performed lengthy calculations. Note that 2 = 355/113 can be obtained from the 


values of Ptolemy and Archimedes: 


355 377-22 
113. «120-7 ° 


He declared that 22/7 is an inaccurate value whereas 355/113 is the accurate value of z. 
We also note that 7 = 355/113 can be obtained from the values of Liu Hui and Archimedes. 


In fact, by using the method of averaging, we have 


157+(9x 22) 355 
50+(9x7) 113° 


486. Bhaskara II or Bhaskaracharya (working 486) wrote Siddhanta Siromani (crown 
of treatises), which consists of four parts, namely, Leelavati Bijaganitam, Grahaganitam 
and Goladhyaya. The first two exclusively deal with mathematics and the last two with 
astronomy. His popular text Leelavati was written in 486 AD in the name of his daugh- 
ter. His contributions to mathematics include: a proof of the Pythagorean theorem, solu- 
tions of quadratic, cubic, and quartic indeterminate equations, solutions of indeterminate 
quadratic equations, integer solutions of linear and quadratic indeterminate equations, 
a cyclic Chakravala method for solving indeterminate equations, solutions of the Pell’s 
equation and solutions of Diophantine equations of the second order. He solved quadratic 
equations with more than one unknown, and found negative and irrational solutions, pro- 
vided preliminary concept of infinitesimal calculus, along with notable contributions to- 
ward integral calculus, conceived differential calculus, after discovering the derivative and 
differential coefficient, stated Rolle’s theorem, calculated the derivatives of trigonometric 
functions and formulae and developed spherical trigonometry. He conceived the modern 
mathematical convention that when a finite number is divided by zero, the result is in- 
finity. He speculated the nature of the number 1/0 by stating that it is ‘like the Infinite, 
Invariable God who suffers no change when old worlds are destroyed or new ones cre- 


ated, when innumerable species of creatures are born or as many perish’ He gave several 
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approximations for 2. According to him 3,927/1,250 is an accurate value, 22/7 is an in- 
accurate value, and /10 is for ordinary work. The first value may have been taken from 
Aryabhatta. This approximation has also been credited to Liu Hui and Zu Chongzhi. He 
also gave the value 754/240 = 3.1416, which is of uncertain origin; however, it is the same 
as that by Ptolemy. 

510. Anicius Manlius Severinus Boethius (around 475-526) introduced the public use 
of sun-dials, water-clocks, etc. His integrity and attempts to protect the provincials from 
the plunder of the public officials brought on him the hatred of the Court. King Theodoric 
sentenced him to death while absent from Rome, seized at Ticinum (now Pavia), and in 
the baptistery of the church there tortured by drawing a cord round his head till the eyes 
were forced out of the sockets, and finally beaten to death with clubs on October 23, 526. 
His Geometry consists of the enunciations (only) of the first book of Euclid, and of a few 
selected propositions in the third and fourth books, but with numerous practical appli- 
cations to finding areas, etc. According to him, the circle had been squared in the period 
since Aristotle’s time, but noted that the proof was too long. 

800. Abu Jafar Mohammed Ibn Musa al-Khwarizmi (around 780-850) ‘Mohammed the 
father of Jafar and the son of Musa’ was a scholar in the academy Bait al-Hikma (House 
of Wisdom) founded by Caliph al-Mamun (786-833). His task (along with several other 
scholars) was to translate the Greek and Sanskrit scientific manuscripts. They also stud- 
ied, and wrote on algebra, geometry and astronomy. There al-Khwarizmi encountered the 
Hindu place-value system based on the numerals 0,1, 2,3, 4,5, 6,7, 8,9, including the first 
use of zero as a place holder in positional base notation, and he wrote a treatise around 
820 AD, on what we call Hindu-Arabic numerals. The Arabic text is lost but a Latin trans- 
lation, Algoritmi de numero Indorum (that is, al-Khwarizmi on the Hindu Art of Reckon- 
ing), a name given to the work by Baldassarre Boncompagni in 1857, much changed from 
al-Khwarizmi’s original text (of which even the title is unknown) is known. The French 
Minorite friar Alexander de Villa Dei, who taught in Paris around 1240, mentions the name 
of an Indian king named Algor as the inventor of the new ‘art; which itself is called the 
algorismus. Thus, the word ‘algorithm’ was tortuously derived from al-Khwarizmi (Alch- 
warizmi, al-Karismi, Algoritmi, Algorismi, Algorithm), and has remained in use to this 
day in the sense of an arithmetic operation. This Latin translation was crucial in the in- 
troduction of Hindu-Arabic numerals to medieval Europe. Al-Khwarizmi used m = 22/7 
in algebra, z = /10 in geometry, and z = 62,832/20,000 = 3.1416 in astronomy. 

850. Mahavira (817-875) in his work Ganita Sara Samgraha summarized and extended 
the works of Aryabhatta, Bhaskara, Brahmagupta and Bhaskaracharya. This treatise con- 
tains: a naming scheme for numbers from 10 up to 104, formulas for obtaining cubes of 
sums; techniques for least common denominators (LCM), techniques for combinations 
"C, = n(n —1)(n -2)---(n—r + 1)/r!, techniques for solving linear, quadratic as well higher 
order equations, arithmetic and geometric series, and techniques for calculating areas and 
volumes. He was the first person to mention that no real square roots of negative num- 
bers can exist. According to Mahavira whatever is there in all the three worlds, which are 
possessed of moving and non-moving beings, all that indeed cannot exist without math- 
ematics. He used the approximate value of 2 as 10. He also mentions that the approxi- 
mate volume of a sphere with diameter d is (9/2)(d/2)°, i.e., 2 = 3.375, and exact volume 
is (9/10)(9/2)(d/2)°, i.e., 7 = 3.0375. 
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About 1040. Franco von Liittich (around 1015-1083) claimed to have contributed the 
only important work in the Christian era on squaring the circle. His works are published 
in six books, but only preserved in fragments. 

1220. Fibonacci (Leonardo of Pisa) (around 1170-1250) after the Dark Ages is consid- 
ered the first to revive mathematics in Europe. He wrote Liber Abbaci (Book of the Aba- 
cus) in 1202. In this book, he quotes that ‘The nine Indian numerals are... with these 
nine and with the sign 0 which in Arabic is sifr, any desired number can be written’ His 
Practica geometria, a collection of useful theorems from geometry and (what would even- 
tually be named) trigonometry appeared in 1220, which was followed five years later by 
Liber quadratorum, a work on indeterminate analysis. A problem in Liber Abbaci led to 
the introduction of the Fibonacci sequence for which he is best remembered today; how- 
ever, this sequence earlier appeared in the works of Pingala (about 500 BC) and Virahanka 
(about 600 AD). In Practica geometriae, Fibonacci used a 96-sided polygon, to obtain the 
approximate value of m as 864/275 = 3.141818.... 

1260. Johannes Campanus (around 1220-1296) was chaplain to three popes, Pope Ur- 
ban IV, Pope Nicholas IV and Pope Boniface VIII He was one of the four greatest contem- 
porary mathematicians. Campanus wrote a Latin edition of Euclid’s Elements in 15 books 
around 1260. He used the value of z as 22/7. 

About 1300. Zhao Youqin (born 1271) used a regular polygon of 4 x 2” sides to derive 
mw =3.1415926. 

About 1360. Albert of Saxony (around 1320-1390) was a German philosopher known 
for his contributions to logic and physics. He wrote a long treatise De quadratura circuli 
(Question on the Squaring of the Circle) consisting mostly philosophy. He said ‘following 
the statement of many philosophers, the ratio of circumference to diameter is exactly 22/7; 
of this, there is proof, but a very difficult one’ 

1400. Madhava of Sangamagramma’s (134.0-1425) work has come to light only very re- 
cently. Although there is some evidence of mathematical activities in Kerala (India) prior 
to Madhava, e.g., the text Sadratnamala (about 1300), he is considered the founder of the 
Kerala school of astronomy and mathematics. Madhava was the first to have invented the 
ideas underlying infinite series expansions of functions, power series, trigonometric series 
of sine, cosine, tangent and arctangent, which is 


2n-1 
ee # x 
tan) x =« 4 eee (1) ob aes (4) 
3 5 7 2n-1 


This series is valid for -1 < x < 1, and also for x = 1. He also gave rational approximations 
of infinite series, tests of convergence of infinite series, estimate of an error term, early 
forms of differentiation and integration and the analysis of infinite continued fractions. 
He fully understood the limit nature of the infinite series. Madhava discovered the solu- 
tions of transcendental (transcends the power of algebra) equations by iteration, and found 
the approximation of transcendental numbers by continued fractions. He also gave many 
methods for calculating the circumference of a circle. The value of z correct to 13 decimal 
places is attributed to Madhava. However, the text Sadratnamala, usually considered as 
prior to Madhava, while some researchers have claimed that it was compiled by Madhava, 
gives the astonishingly accurate value of z correct to 17 decimal places. 

1429. Jemshid al-Kashi (around 1380-1429), astronomer royal to Ulugh Beg of Sam- 
arkand, wrote several important books Sullam al-sama (The Stairway of Heaven), 
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Mukhtasar dar ‘ilm-i hayat (Compendium of the Science of Astronomy), Khaqani Zij on 
astronomical tables, Risala dar sharh-i alat-i rasd (Treatise on the Explanation of Obser- 
vational Instruments), Nuzha al-hadaigq fi kayfiyya sana al-ala almusamma bi tabag al- 
manatig (The Method of Construction of the Instrument Called Plate of Heavens), Risala 
al-muhitiyya (Treatise on the Circumference), The Key to Arithmetic, and The Treatise on 
the Chord and Sine. In these works al-Kashi showed a great venality in numerical work. 
In 1424, he calculated z to 14 decimal places, and later in 1429 to 16 decimal places. For 
this, he used classical polygon method of 6 x 27” sides. 

1460. George Piirbach (1423-1461) whose real surname is unknown, was born in 
Piirbach, a town upon the confines of Bavaria and Austria. He studied under Nicholas 
de Cusa, and one of his most famous pupils is Regiomontanus. Piirbach wrote a work on 
planetary motions which was published in 1460; an arithmetic, published in 1511; and a 
table of eclipses, published in 1514. He calculated tables of sines for every minute of arc 
for a radius of 600,000 units. This table was published in 1541. He approximated z by the 
rational 62,832/20,000, which is exactly the same as given by Aryabhatta. 

1464. Nicholas of Cusa (1401-1464) is often referred to as Nicolaus Cusanus and 
Nicholas of Kues (Cusa was a Latin place-name for a city on the Mosel). He was a German 
cardinal of the Roman Catholic Church, a philosopher, jurist, mathematician and an as- 
tronomer. Most of his mathematical ideas can be found in his essays, De Docta Ignorantia 
(Of Learned Ignorance), De Visione Dei (Vision of God) and On Conjectures. He made 
important contributions to the field of mathematics by developing the concepts of the in- 
finitesimal and of relative motion. He gave the approximations of z as (3/4)(/3 + /6) 
and 24.,/21/35 = 3.142337.... Nicholas thought this to be the exact value. Nicholas said, 
if we can approach the Divine only through symbols, then it is most suitable that we use 
mathematical symbols, for these have an indestructible certainty. He also said that no per- 
fect circle can exist in the universe. In accordance with his wishes, his heart is within the 
chapel altar at the Cusanusstift in Kues. 

1464. Johann Regiomontanus (Johannes Miller) (1436-1476) is considered as one of 
the most prominent mathematicians of his generation. He was the first to study Greek 
mathematical works in order to make himself acquainted with the methods of reasoning 
and results used there. He also well read the works of the Arab mathematicians. In most 
of this study, he compiled in his De Triangulis, which was completed in 1464, however, 
was published only in 1533. Regiomontanus used algebra to find solutions of geometrical 
problems. He criticized Nicholas of Cusa’s approximations and methods to approximate 
the value of z and gave the approximation 3.14343. 

About 1500. Nilakanthan Somayaji’s (around 1444-1544) most notable work Tantra- 
sangraha elaborates and extends the contributions of Madhava. He was also the author 
of Aryabhatiya-Bhashya, a commentary of the Aryabhatiya. Of great significance in 
Nilakanthan’s work includes the inductive mathematical proofs, a derivation and proof of 
the arctangent trigonometric function, improvements and proofs of other infinite series 
expansions by Madhava, and in Sanskrit poetry the series 
1 


1 
4 Weis 5 
9 u m 


which follows from Madhava’s series (4) when x = 1. In the literature (5) is known as 
Gregory-Leibniz series. He also gave sophisticated explanations of the irrationality of 
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x, the correct formulation for the equation of the center of the planets, and a helio- 
centric model of the solar system. If s,, denotes the mth partial sum of (5), then s; = 1, 
519 = 0.76045..., S199 = 0.78289..., 51,000 = 0.78514... , 10,000 = 0.78537 ... and Roy North 
showed that 4s500,000 = 3.14159065358979324.0462643383269502884197 (where under- 
lined digits are incorrect) indicating an annoyingly slow convergence of the partial sums. 
Since this is an alternating series, the error committed by stopping at the mth term does 
not exceed 1/(27 + 1) in absolute value. Thus, to compute 7/4 to eight decimals from (5) 
would require n > 10° terms. Hence, although it is only of theoretical interest, the expres- 
sions on the right are arithmetical, while arises from geometry. We also note that the 


series (5) can be written as 


iT 1 1 1 
=1-2 + + tered. 
4 3-5 7-9 11-13 
The following expansion of z is also due to Nilakanthan 


4 4 4 4 


mw=3+ - + tees 
2:3-4 4-5-6 6-7-8 8-9-10 


This series converges faster than (5). 

Before 1510. Leonardo da Vinci (1452-1519) was an Italian painter, sculptor, architect, 
musician, scientist, mathematician, engineer, inventor, anatomist, geologist, cartographer, 
botanist and writer. He briefly worked on squaring the circle, or approximating 7. 

1525. Michael Stifel (1486-1567) served in several different Churches at different posi- 
tions; however, every time due to bad circumstances had to resign and flee. He made the 
error of predicting the end of the world on 3 October 1533, and other time used a clever 
rearrangement of the letters LEO DECIMVS to ‘prove’ that Leo X was 666, the number 
of the beast given in the Book of Revelation. He was forced to take refuge in a prison af- 
ter ruining the lives of many believing peasants who had abandoned work and property to 
accompany him to heaven. In the later part of his life, he lectured on mathematics and the- 
ology. He invented logarithms independently of Napier using a totally different approach. 
His most famous work is Arithmetica integra which was published in 1544. This work 
contains binomial coefficients, multiplication by juxtaposition, the term ‘exponent; and 
the notation +, — and ./, and the opinion that the quadrature of z is impossible. Ac- 
cording to him ‘the quadrature of the circle is obtained when the diagonal of the square 
contains 10 parts of which the diameter of the circle contains 8: Thus, 7 ~ 3 1/8. 

1525. Albrecht Diirer (1471-1528) was a famous artist and mathematician. His book 
Underweysung der Messung mit dem Zirckel und Richtscheyt provides measurement of 
lines, areas and solids by means of compass and ruler, particularly there is a discussion of 
squaring the circle. 

1544. Oronce Fine (1494-1555) was a prolific author of mathematical books. He was 
imprisoned in 1524, probably for practicing judicial astrology. He approximated z as 
3 11/63 = 3.174603.... Later, he gave 3 2/15 = 3.133333... and, in 1556, 3 11/78 = 
3.141025.... 

1559. Johannes Buteo (1492-1572), a French scholar published a book De quadratura 
circuli, which seems to be the first book that accounts the history of z and related prob- 


lems. 
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1573. Valentin Otho (around 1550-1603) was a German mathematician and astronomer. 
In 1573, he came to Wittenberg and proposed to Johannes Praetorius the Tsu Ch’ung-chih 
approximate value of m as 355/113. 

1580. Tycho Brahe was an astronomer and an alchemist and was known for his most 
accurate astronomical and planetary observations of his time. His data was used by his 
assistant, Kepler, to derive the laws of planetary motion. He observed a new star in 1572 
and a comet in 1577. In 1566, when he was just 20, he lost his nose partially in a duel with 
another student in Wittenberg and wore throughout his life a metal insert over his nose. 
His approximation to z is 88//785 = 3.140854... 

1583. Simon Duchesne finds m = (39/22)? = 3.142561.... 

About 1584. Zhu Zaiyu (1536-1611), a noted musician, mathematician and astronomer- 
calendarist, Prince of the Ming Dynasty, obtained the twelfth root of two. He also gave 
the approximate value of z as /2/0.45 = 3.142696.... Around the same time Xing Yunlu 
adopted z as 3.1126 and 3.12132034, while Chen Jinmo and Fang Yizhi, respectively, took 
as 3.1525 and 52/17. 

1584. Simon van der Eycke (Netherland) published an incorrect proof of the quadrature 
of the circle. He approximated 7 as 1,521/484 = 3.142561.... In 1585, he gave the value 
3.1416055. 

1585. Adriaen Anthoniszoon (1529-1609) was a mathematician and fortification engi- 
neer. He rediscovered the Tsu Ch’ung-chih approximation 355/113 to 2. This was appar- 
ently lucky incident, since all he showed was that 377/120 > m > 333/106. He then averaged 
the numerators and the denominators to obtain the ‘exact’ value of z. 

1593. Francois Viéte (1540-1603) is frequently called by his semi-Latin name of Vieta. 
In relation to the three famous problems of antiquity, he showed that the trisection of an 
angle and the duplication of a cube problems depend upon the solution of cubic equa- 
tions. He has been called the father of modern algebra and the foremost mathemati- 
cian of the sixteenth century. In his 1593 book, Supplementum geometriae, he showed 
3.1415926535 < m < 3.1415926537, i.e., gave the value of z correct to 9 places. For this, he 
used the classical polygon of 6 x 2° = 393,216 sides. He also represented z as an infinite 
product 


(6) 


ca 1 V2 2+ V2) y 2+¥ 2+ v2) 
eS 5 5 tik 


For this, we note that 


: x # x x 5. * - x\ i... # 

sinx = cos — - 2sin~ = cos —cos — -2*sin— =::-= ] [coos 2 sin | 
2 2 2 22 2? 2 2 
k=1 
and hence 

sinx (11 x sin x/2* 
—F= cos — | ———, 

x 2k J x /2k 

k=1 / 


which as k + oo, and then x = 7/2 gives 
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Finally, note that 


The above formula (6) is one of the milestones in the history of 7. The convergence of 
Vieta’s formula was proved by Ferdinand Rudio (1856-1929) in 1891. It is clear that Vieta’s 
formula cannot be used for the numerical computation of z. In fact, the square roots are 
much too cumbersome, and the convergence is rather slow. It is clear that if we define 
a = J1/2 and ay41 = (1. + a,)/2, then (6) is the same as 4243 +++ = 2/7. 

1593. Adrianus van Roomen (1561-1615), more commonly referred to as Adrianus 
Romanus, successively professor of medicine and mathematics in Louvain, professor of 
mathematics at Wiirzburg, and royal mathematician (astrologer) in Poland, proposed a 
challenge to all contemporary mathematicians, to solve a certain 45th degree equation. 
The Dutch ambassador presented van Roomen’s book to King Henry IV with the com- 
ment that at present there is no mathematician in France capable of solving this equation. 
The King summoned and showed the equation to Vieta, who immediately found one so- 
lution to the equation, and then the next day presented 22 more. However, negative roots 
escaped him. In return, Vieta challenged van Roomen to solve the problem of Apollonius, 
to construct a circle tangent to three given circles, but he was unable to obtain a solution 
using Euclidean geometry. When van Roomen was shown proposer’s elegant solution, he 
immediately traveled to France to meet Vieta, and a warm friendship developed. The same 
year Rooman used the classical method with 2”° sides, to approximate z to 15 correct dec- 
imal places. 

1594. Joseph Justus Scaliger (1540-1609) was a religious leader and scholar. He is known 
for ancient Greek, Roman, Persian, Babylonian, Jewish and Egyptian history. In his work, 
Cyclometrica elementa duo he claimed that 7 is equal to 10. 

1596. Ludolph van Ceulen (1539-1610) was a German who emigrated to the Nether- 
lands. He taught Fencing and Mathematics in Delft until 1594, when he moved to Leiden 
and opened a Fencing School. In 1600, he was appointed to the Engineering School at 
Leiden, where he spent the remainder of his life teaching Mathematics, Surveying and 
Fortification. He wrote several books, including Van den Circkel (On The Circle, 1596), 
in which he published his geometric findings, and the approximate value of z correct to 
20 decimal places. For this, he reports that he used classical method with 60 x 25, ie., 
515,396,075,520 sides. This book ends with ‘Whoever wants to, can come closer: 

1610. Ludolph van Ceulen (1539-1610) in his work De Arithmetische en Geometrische 
fondamenten, which was published posthumously by his wife in 1615, computed z correct 
to 35 decimal places by using classical method with 2 sides. This computational feat was 
considered so extraordinary that his widow had all 35 digits of die Ludolphsche Zahl (the 
Ludolphine number) was engraved on his tombstone in St. Peter’s churchyard in Leiden. 
The tombstone was later lost but was restored in 2000. This was one of the last major 
attempts to evaluate z by the classical method; thereafter, the techniques of calculus were 


employed. 
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1621. Willebrord Snell (Snellius) (1580-1626) was a Dutch astronomer and mathemati- 
cian. At the age of 12, he is said to have been acquainted with the standard mathematical 
works, while at the age of 22, he succeeded his father as Professor of Mathematics at Lei- 
den. His fame rests mainly on his discovery in 1621 of the law of refraction, which played a 
significant role in the development of both calculus and the wave theory of light. However, 
it is now known that this law was first discovered by Ibn Sahl (940-1000) in 984. Snell clev- 
erly combined Archimedean method with trigonometry, and showed that for each pair of 
bounds on z given by the classical method, considerably closer bounds can be obtained. 
By his method, he was able to approximate z to seven places by using just 96 sides, and 


239 sides. The classical 


to van Ceulen’s 35 decimal places by using polygons having only 
method with such polygons yields only two and fifteen decimal places. 

1627. Yoshida Mitsuyoshi (1598-1672) was working during Edo period. His 1627 work 
named as Jinkoki deals with the subject of soroban arithmetic, including square and cube 
root operations. In this work, he used 3.16 for z. 

1630. Christoph (Christophorus) Grienberger (1561-1636) was an Austrian Jesuit as- 
tronomer. The crater Gruemberger on the Moon is named after him. He used Snell's re- 
finement to compute z to 39 decimal places. This was the last major attempt to compute 
a by the Archimedes method. 

1635. Celiang quanyi (Complete Explanation of Methods of Planimetry and Stere- 
ometry) gives without proof the following bounds 3.14159265358979323846 < m < 
3.14159265358979323847, i.e., 7 correct to 19 digits. 

1647. William Oughtred (1575-1660), an English mathematician offered free mathe- 
matical tuition to pupils, which included even Wallis. His textbook, Clavis Mathematicae 
(The Key to Mathematics) on arithmetic published in 1631 was used by Wallis and Newton 
amongst others. In this work, he introduced the x symbol for multiplication, and the pro- 
portion sign (double colon ::). He designated the ratio of the circumference of a circle to 
its diameter by 7/5. His notation was used by Isaac Barrow (1630-1677) a few years later, 
and David Gregory (1659-1708). Before him, mathematicians described z in round-about 
ways such as ‘quantitas, in quam cum multipliectur diameter, proveniet circumferential, 
which means ‘the quantity which, when the diameter is multiplied by it, yields the circum- 
ference’ 

1647. Grégoire de Saint-Vincent (1584-1667), a Jesuit, was a mathematician who discov- 
ered that the area under the hyperbola (xy = k) is the same over [a,b] as over [c,d] when 
alb = c/d. This discovery played an important role in the development of the theory of log- 
arithms and an eventual recognition of the natural logarithm. In 1668, Nicolaus Mercator 
(Kauffmann) (1620-1687) wrote a treatise entitled Logarithmo-technica, and discovered 
the series 


1 1 1 
In(1 + x) =x — <x? + =x — xt +---; (7) 
2 3 4 


however, the same series was independently discovered earlier by Saint-Vincent. In his 
book, Opus geometricum quadraturae circuli et sectionum coni he proposed at least four 
methods of squaring the circle, but none of them were implemented. The fallacy in his 
quadrature was pointed out by Huygens. 

1650. René Descartes (1596-1650) was a thoughtful child who asked so many ques- 
tions that his father called him ‘my little philosopher’ In 1638, he published his Discourse 
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on Method, which contained important mathematical work, and three essays, Meteors, 
Dioptrics and Geometry, produced an immense sensation and his name became known 
throughout Europe. The rectangular coordinate system is credited to Descartes. He is re- 
garded as a genius of the first magnitude. He was one of the most important and influential 
thinkers in human history and is sometimes called the founder of modern philosophy. Af- 
ter his death, a novel geometric approach to approximate z was found in his papers. His 
method consisted of doubling the number of sides of regular polygons while keeping the 


perimeter constant. In modern terms, Descartes’ method can be summarized as 


z k oe 
a = lim 2*tan{ — }. 
k->00 2k 


If we let a, = 2* tan(z/25), k > 2, then in view of tan20 = 2tan@/(1 — tan? @), x, = l/ax 


satisfies the relation 


-2k-2 
Hs (Xks1 — HK) = 2 ; 


and hence 
1 2 —2k\ 1/2 
Kar = 5 (mE + (mE +2 yy), k= 2, xq = 1/4. 
The sequence {x;} generated by the above recurrence relation converges to 1/7. 
1650. John Wallis (1616-1703) in 1649 was appointed as Savilian professor of geome- 
try at the University of Oxford, which he continued for over 50 years until his death. He 


was the most influential English mathematician before Newton. In his most famous work, 
Arithmetica infinitorum, which he published in 1656, he established the formula 


m=2 


2 4 4 6 8 
ae Pace sean (8) 
1 3.5 5 9 


NI] 0 


2 6 

a 7 
This formula is a great milestone in the history of z. Like Viéte’s formula (6), Wallis had 
found z in the form of an infinite product, but he was the first in history whose infinite 
sequence involved only rational operations. In his Opera Mathematica I (1695), Wallis in- 
troduced the term continued fraction. He rejected as absurd the now usual idea of a nega- 
tive number as being less than nothing, but accepted the view that it is something greater 
than infinity, specially showed that —1 > oo. He had great ability to do mental calculations. 
He slept badly and often did mental calculations as he lay awake in his bed. On 22 De- 
cember 1669, he when in bed, occupied himself in finding the integral part of the square 
root of 3 x 10*°; and several hours afterward wrote down the result from memory. Two 
months later, he was challenged to extract the square root of a number of 53 digits; this he 
performed mentally, and a month later he dictated the answer which he had not meantime 
committed to writing. Wallis’ life was embittered by quarrels with his contemporaries in- 
cluding Huygens, Descartes, and the political philosopher Hobbes, which continued for 
over 20 years, ending only with Hobbes’ death. Hobbes called Arithmetica infinitorum ‘a 
scab of symbols, and claimed to have squared the circle. It seems that to some, individual's 


quarrels give strength, encouragement and mental satisfaction. To derive (8), we note that 
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/2.. : P 
= Ie sin” x dx satisfies the recurrence relation 


2m-1 2m-3 5 3 1a 
lom = ‘ toh he ange ed 
2m 2m —2 642 2 
and 
2m 2m —-2 6 4 2 
Jom = 


Im+1 2m-1 75 3° 


From these relations, a termwise division leads to 


1 2-4-6---2m 2 1 Lom 
2° \3-5---(Qm-1)) 2m+ 1 ama’ 


Now, it suffices to show that 


* Lom, 
lim 
mo Ton41 


aay (10) 


2m-1 


We know that for all x € (0, 7/2) the inequalities sin x > sin?” x > sin?”"*! x hold. Thus, 


an integration from 0 to 2/2 gives yy) > Lom > Lom+i, and hence 


Loma > Lom 


>1. (11) 


loms1 i Loms1 
Further, from (9), we have 


Loy-1 _ 2m+1 


’ 


Dont 2m 
thus, it follows that 


~— domi 4. 2m+1 
lim = lim = 
m+ Inn.) m>oo 2m 


(12) 


Finally, a combination of (11) and (12) immediately gives (10). If we define a, = 1—1/(2n)*, 


then (8) is equivalent to 4\4243 --- = 2/1. We also note that 


1 iT (<) 
=—+0O i 
4d2°**a, 2 n 


1650. William Brouncker, 2nd Viscount Brouncker (1620-1684) was one of the founders 


and the second President of the Royal Society. His mathematical contributions are: repro- 


duction of Brahmagupta’s solution of a certain indeterminate equation, calculations of the 
lengths of the parabola and cycloid, quadrature of the hyperbola which required approx- 


imation of the natural logarithm function by infinite series and the study of generalized 
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continued fractions. He undertook some calculations to verify formula (8), and showed 
that 3.141592653569...< 1 < 3.141592653696..., which is very satisfactory. He also con- 
verted Wallis’ result (8) into the continued fraction 


z= , (13) 


25 
2+ 
Dees 


Neither of the expressions (8), and (13); however, later has served for an extensive calcu- 
lation of z. 
Another continued fraction representation of 2 which follows from the series (5) is 


32 
Dituces 


5+ 


1654. Christiaan Huygens (1629-1695) is famous for his invention of the pendulum 
clock, which was a breakthrough in timekeeping. He formulated the second law of mo- 
tion of Newton in a quadratic form, and derived the now well-known formula for the 
centripetal force, exerted by an object describing a circular motion. Huygens was the 
first to derive the formula for the period of an ideal mathematical pendulum (with mass- 
less rod or cord), T = 27 Jelg. For the computation of , he gave the correct proof of 
Snell’s refinement, and using an inscribed polygon of only 60 sides obtained the bounds 
3.1415926533 < m < 3.1415926538, for the same accuracy the classical method requires 
almost 400,000 sides. 

1663. Muramatsu Shigekiyo (1608-1695) published Sanso, or Stack of Mathematics, in 
which he used classical polygon method of 2" sides to obtain z = 3.14195264877. 

1665. Sir Isaac Newton (1642-1727), hailed as one of the greatest scientist-mathemati- 
cians of the English-speaking world, had the following more modest view of his own mon- 
umental achievements: ‘... to myself I seem to have been only like a boy playing on the 
seashore, and diverting myself in now and then finding a smoother pebble or a prettier 
shell than ordinary, whilst the great ocean of truth lay all undiscovered before me’ As he 
examined these shells, he discovered to his amazement more and more of the intricacies 
and beauties that lay in them, which otherwise would remain locked to the outside world. 
At the age of 26, he succeeded Barrow as Lucasian professor of mathematics at Cambridge. 
About him, Aldous Huxley (1894-1963) had said ‘If we evolved a race of Isaac Newtons, 
that would not be progress. For the price Newton had to pay for being a supreme intellect 
was that he was incapable of friendship, love, fatherhood and many other desirable things. 
As aman he was a failure; as a monster he was superb: Newton made some of the greatest 
discoveries the world ever knew at that time. Newton discovered: 1. The nature of colors. 
2. The law of gravitation and the laws of mechanics. 3. The fluxional calculus. Most of the 
history books say that to compute 7 Newton used the series 


nj 1-2 6 1-3-x5 1-3-5-x7 
sin x =X + + + tee, 
2-3. 2-4. 2:4 


22. BIRTH, GROWTH AND COMPUTATION 


Agarwal et al. Advances in Difference Equations 2013, 2013:100 
http://www.advancesindifferenceequations.com/content/2013/1/100 


OF PI TO TEN TRILLION DIGITS (2013) 385 


Page 21 of 59 


Figure 2 Half-circle. D 


which for x = 1/2 gives 


nm. ofl 1 1 1-3 
—=sin’( —)= + + tere |; 
6 2 2 2-3-2) 2-4-5.25 


however, he actually used twenty-two terms to obtain 16 decimal places of the following 


series 


n= 33s 24( = : : : ~). (14) 


4 3-22 5-25 28-27 72-29 


Later, he wrote ‘Iam ashamed to tell you to how many figures I carried these computations, 
having no other business at the time: His result was not published until 1737 (posthu- 
mously). 

Using analysis and geometry, the series (14) can be obtained as follows: From Figure 2, 


the equation of the upper half circle is y = x”?(1 — x)”. Thus, binomial expansion gives 


1 1 1 5 7 
V2 _ = 3/2 _ 5/2 7/2 _ 9/2 
8 16 128 256 


yex ll 


Thus, the area of the sector ABD is (integrating the above series from 0 to 1/4) 


AABD = — : : (15) 
~ 3-22 5-25 28-27 72-29 ; 


Also, from geometry the area of the sector ABD is 


AABD = AABCD —- ABCD 


1/1_/1\’\ 1/1 1\’ f1\? x VB 
=—(-z|{- = ‘ (16) 
3\2 2 2\4 2 4 24 32 

Equating (15) and (16), we immediately get (14). 


1666. Thomas Hobbes of Malmesbury (1588-1679) was an English philosopher, best 
known today for his work on political philosophy. He also contributed in several other di- 


verse fields, including history, geometry, the physics of gases, theology, ethics and general 
philosophy. He approximated z by 3 1/5 = 3.2, which was refuted by Huygens and Wallis. 
In 1678, he also gave the approximation /10. 

1671. James Gregory (1638-1675) published two books Vera circuli et hyperbolae 
quadratura in 1667, and Geometriae pars universalis in 1668. In the first book partic- 
ularly, he showed that the area of a circle can be obtained in the form of an infinite con- 
vergent series only, and hence inferred that the quadrature of the circle was impossible. 
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In the second book, he attempted to write calculus systematically, which perhaps made 
the basis of Newton’s fluxions. This book also contains series expansions of sin(x), cos(x), 
arcsin(x) and arccos(x); however, as we have seen earlier these expansions were known 
to Madhava. Gregory anticipated Newton in discovering both the interpolation formula 
and the general binomial theorem as early as 1670. In early 1671, he discovered Taylor’s 
theorem (published by Taylor in 1715); however, he did not publish. Later in 1671, he redis- 
covered Nilakanthan’s arctangent series (5). In his Vera circuli et hyperbolae quadratura of 
1667, Gregory tried to show that 7 was a transcendental number, but his attempt, though 
very interesting, was not successful. Huygens made detailed and rather biased criticisms 
of it. 

1672. Pietro Mengoli (1626-1686) studied at the University of Bologna, and became a 
professor there in 1647 for the next 39 years of his life. Besides proposing Basel problem, 
he proved that the harmonic series does not converge, established that the alternating har- 
monic series is equal to the natural logarithm of 2, published on the problem of squaring 
the circle, and provided a proof that Wallis’ product (8) for z is correct. 

1674. Gottfried Wilhelm von Leibniz (1646-1716) was a universal genius who won recog- 
nition in many fields - law, philosophy, religion, literature, politics, geology, metaphysics, 
alchemy, history and mathematics. He shares credit with Newton in developing calculus 
independently. He popularized and gave several mathematical symbols. Leibniz tried to 
reunite the Protestant and Catholic churches. He in binary arithmetic saw the image of 
Creation. He imagined that Unity represented God, and Zero the void; that the Supreme 
Being drew all beings from the void, just as unity and zero express all numbers in the bi- 
nary system of numeration. He communicated his idea to the Jesuit Grimaldi, who was 
the President of the Chinese tribunal for mathematics in the hopes that it would help con- 
vert to Christianity the Emperor of China, who was said to be very fond of the Sciences. 
Later Leibniz became an expert in the Sanskrit language and the culture of China. For 
calculating 2, he developed a method without any reference to a circle. In 1674, he also 
rediscovered Nilakanthan’s arctangent series (5), whose beauty he described by saying that 
Lord loves odd numbers. Leibniz even invented a calculating machine that could perform 
the four operations and extract roots. 

1684. Isomura Yoshinori (1640-1710) employed a 2!’-sided inscribed polygon to obtain 
3.141592664 for z, but for some reason he wrote only z = 3.1416. 

1685. Father Adam Adamad Kochansky (1631-1700) was librarian of the Polish King 
John III. He was the first to utilize a steel spring for suspension of the pendulum of a 
clock. He used a new approximate geometric construction for z to obtain 


[40 
mx — 2/3 = 3.141533... 


His method was later quoted in several geometrical textbooks. 

1690. Takebe Katahiro (1664-1739) also known as Takebe Kenko played a critical 
role in the development of a crude version of the calculus. He also created charts for 
trigonometric functions. He used polygon (just 1,024 sides) approximation and a numer- 
ical method which is essentially equivalent to the Romberg algorithm (rediscovered by 
Sigmund Romberg, 1887-1951) to compute z to 41 digits. In 1722, Takebe obtained power 


series expansion of (sin? x)?,15 years earlier than Euler. Around 1729, essentially the same 
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series was rediscovered by Oyama Shokei who used it to find the expansion 


oo 
2" (n!)? 11 11-2 1 1-2-3 
n= 1+) aay" SOs ape sees access) fi, 
“ (2n +2)! 23 3 3-5 4 3-5-7 


The above expansion of x? was also given by Yamaji Nushizumi (1704-1772) around 1765. 
1699. Abraham Sharp (1653-1742) was a mathematician and astronomer. In 1688, he 
joined the Greenwich Royal Observatory and did notable work, improving instruments 


and showing great skill as a calculator. He also worked on geometry and improved log- 
arithmic tables. In the supervision of Edmund Halley (1656-1742), he realized that by 
putting « = 1 in (4) (see (5)) we lose the benefit of the powers x°,x°,x’,..., which tend 
to increase the rapidity of convergence for smaller values of x. He substituted x = 1/./3 in 
(4), to obtain 


as 1 1 a7) 
= + - free |, 
6 V3\ 3:3. 32-5 33-7 


Sharp used (17) to calculated z to 72 decimal places out of which 71 digits are correct. In 
(17), the 10th term is u(/3- 19 -3°), which is less than 0.00005, and hence we have at least 
4 places correct after just 9 terms. It is believed that Madhava of Sangamagramma used 
the same series in the fourteenth century to compute the value of z correct to 11 decimal 
places. 

1700. Seki Takakazu also known as Seki Kowa (1642-1708) is generally regarded as the 
greatest Japanese mathematician. He was a prolific writer, and a number of his publica- 
tions are either transcripts of mathematics from Chinese into Japanese, or commentaries 
on certain works of well-known Chinese mathematicians. His interests in mathematics 
ranged recreational mathematics, magic squares and magic circles, solutions of higher- 
order and indeterminate equations, conditions for the existence of positive and negative 
roots of polynomials, and continued fractions. He discovered determinants ten years be- 
fore Leibniz, and the Bernoulli numbers a year before Bernoulli. He used polygon of 2” 
sides and Richardson extrapolation (rediscovered by Alexander Craig Aitken, 1895-1967) 
to compute zr to 10 digits. Some authors believe that he also used the formula 


n 
4 
x= lim —~ > Vn? —j. 
noo W12 = ] 
= 


About 1700. Oliver de Serres believed that by weighing a circle and a triangle equal to 
the equilateral triangle inscribed he had found that the circle was exactly double of the 
triangle, not being aware that this double is exactly the hexagon inscribed in the same 
circle. Thus, according to him z = 3. 

1706. William Jones (1675-1749), an obscure English writer, represented the ratio of the 
circumference ofa circle to its diameter by z in his Synopsis Palmariorum Matheseos (New 
Introduction to the Mathematics). He used the letter z as an abbreviation for the Greek 
word perimetros (periphery) (of a circle with unit diameter). In his book, he published the 
value of z correct to 100 decimal places. 

1706. John Machin (1680-1752) was a professor of astronomy at Gresham College, Lon- 
don. He also served as secretary of the Royal Society during 1718-1747. Machin is best 
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remembered for computing the value of to 100 decimal places by using the formula 


1 1 
i= 4tan“!( = ) —tan-?( —— }, (18) 
4 5 239 


which in view of (4) is the same as 


1 A 1 1 1 1 1 1 (19) 
= + asia pe ane 
4 5 3-53 5.59 2393-2393 5.2395 


To establish (18), we let tan6 = 1/5, so that 


2tand 5 2 tan 20 120 
tan 20 = ————— = — and tan4@ = ———_— = —_. 
1-tan?6 12 1—tan?29 119 


Thus, it follows that 


il tan4@ -1 1 
tan| 40 = = 
4 1+tan49 239 


and hence 


-1 1 1 a ii 1 oa 
tan | —— ] =40-— =4tan’ (-]-—. 
239 4 5 4 


The proof of (18) also follows by comparing the angles in the identity (the idea originally 
goes back to Caspar Wessel (1745-1818) who presented his work in 1797 to the Royal Dan- 
ish Academy of Sciences) 


(5 + #)*(-239 + i) = -114,244(1 + 0), 


1 1 
aan(=) +70 - tan (55) =n +tan71, 


The series (19) certainly converges significantly faster than (5) and (17). In fact, taking six 
terms of the first series and two terms of the second and paying attention to the remainders 
and round-off errors, we get the inequalities 3.141592629 < m < 3.141592668. Thus, the 
value of z correct to seven decimals is 3.1415926. 

Several other Machin-type formulas are known, e.g., 


Tw afl afl 
— =2tan’{ —}-tan|{ - 
4 2 7 
1 1 1 
=5tan1( = ) -3tan7?( — ) -2tan7!( — 
5 18 57 
afl afl af 1 4 1 
=17tan™ | — } +3tan| —— } —2tan™ | —— } -7tan ‘i 
22 172 682 5,357 


For a long list of such type of formulas with a discussion of their relative merits in com- 


putational work, see Lehmer (1938). 
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1713. Chapter 15 of Shu li jing yun (Collected Basic Principles of Mathematics), which 
was commissioned by the Emperor Kang Xi and edited by Mei Gucheng and He Guozong, 
gives m = 3.14159265, which is correct to eight decimal places. 

1719. Thomas Fantet de Lagny (1660-1734) was a French mathematician who is well 
known for his contributions to computational mathematics. He used the series (17) to 
determine the value of z up to 127 decimal places; however, only 112 are correct. 

1728. Alexander Pope (1688-1744) was an English poet. He is the third-most frequently 
cited writer in The Oxford Dictionary of Quotations, after Shakespeare and Tennyson. In 
his Dunciad it is mentioned that ‘The mad Mathesis, now, running round the circle, finds 
it square’ This explains the wild and fruitless attempts of squaring the circle. 

1728. Sieur Malthulon (France) offered solutions to squaring the circle and to perpet- 
ual motion. He offered 1,000 crowns reward in legal form to anyone proving him wrong. 
Nicoli, who proved him wrong, collected the reward and abandoned it to the Hotel Dieu 
of Lyons. Later, the courts gave the money to the poor. 

1730. Toshikiyo Kamata (1678-1747) used both the circumscribed and inscribed 
polygons and gave the bounds 3.14159265358979323846265341667 < a < 
3.1415926535897932384.6264336658. 

1730. Abraham De Moivre (1667-1754) was an intimate personal friend of Newton, and 
was elected an FRS of London in 1697. In 1710, he was appointed to the Commission set 
up by the Royal Society to solve the Newton-Leibniz dispute concerning which of them 
invented calculus first. He is best known for his memoir Doctrine of Chances: A method of 
calculating the probabilities of events in play, which was first printed in 1618 and dedicated 
to Newton. In 1722, he published his famous theorem (cosx + isinx)” = cosnx + sinnx. In 
his Miscellanea Analytica published in 1730, appears the formula for very large n, 


V/2_-ny) 


n 
, 


n! x (27n) 


which is known today as Stirling’s formula. In 1733, De Moivre used this formula to derive 
the normal curve as an approximation to the binomial. 

1735. Leonhard Euler (1707-1783) was probably the most prolific mathematician who 
ever lived. He was born in Basel (Switzerland), and had the good fortune to be tutored one 
day a week in mathematics by a distinguished mathematician, Johann Bernoulli (1667- 
1748). Euler’s energy and capacity for work were virtually boundless. His collected works 
form about 60 to 80 quarto-sized volumes and it is believed that much of his work has 
been lost. What is particularly astonishing is that Euler became virtually sightless in his 
right eye during the mid-1730s, and was blind for the last 17 years of his life, and this was 
one of the most productive periods. In 1644, Mengoli asked for the precise summation 
of the infinite series }°°, n~*. The series is approximately equal to 1.644934.... In the 
literature, this problem has been referred after Basel, hometown of Euler as well as of the 
Bernoulli family who unsuccessfully attacked the problem. Basel problem appears in num- 
ber theory, e.g., if two positive integers are selected at random and independently of each 
other, then the probability that they are relatively prime is ()-"°, n~*)"1 (R. Chartres, 1904). 
An integer that is not divisible by the square of any prime number is said to be square free. 
2)-1 


The probability that a randomly selected integer is square free is also ()\~°, n~*)"1. Euler 


considered the function sinx/x, x 4 0 which has the roots at tz, n = 1,2,.... Thus, it 
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follows that 
sinx x? xt x6 
=1 + + 
x 3! 5! 7! 


x x x 
- (: a)(1- 5) (1- aa) : G0) 


Thus, on equating the coefficients of x2, we get 


1 1 1 1 
6 Px ” 2272 a 3272 v 


9 


which is the same as 


Pee (21) 


The above proof of Euler is based on manipulations that were not justified at the time, and 
it was not until 1741 that he was able to produce a truly rigorous proof. It is interesting to 
note that (20) with « = 1/2 immediately gives Wallis’ formula (8). Today, several different 
proofs of (21) are known in the literature. Euler also established the following series: 


ma \ in 

1 11 a 
ee ge ge 

1 1 1 1 x 
Pp pte pr Ap 


Later, Euler generalized the Basel problem considerably, in fact, for all positive integers 
k =1,2,..., he established the series 


1 1 1 (27)? (-1)"*1 Boy. 


Pies ier eae 
Bk" 92k * 32k 2(2n)! 


, 


where Bo, are Bernoulli numbers: By = 1/6,B, = —1/30,Be = 1/42,Bg = —1/30, By = 
5/66,.... In particular, he established 


1 1 1 _ 24 x 76,977,927 _ 96 
p26 * g26 * 326 27! aa 


Euler’s ideas were taken up years later by George Friedrich Bernhard Riemann (1826-1866) 
in his seminal 1859 paper, On the Number of Primes Less Than a Given Magnitude, in 
which he defined his zeta function 


¢(s)=l+—+—+—+---, s=otit, 
and proved its basic properties, and in a sovereign way simply stated a number of oth- 


ers without proof. After his death, many of the finest mathematicians of the world have 


exerted their strongest efforts and created new branches of analysis in attempts to prove 
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these statements. Since then with one exception, every statement has been settled in the 
sense Riemann expected. This exception is the famous Riemann hypothesis: that all the ze- 
ros of ¢(s) in the strip 0 < o <1 lie on the central line o = 1/2. It stands today as the most 
important unsolved problem of mathematics, and perhaps the most difficult problem that 
the mind of man has ever conceived. 

1737. The letter 7 was first used by Euler in 1737 in his Variae observationes circa se- 
ries infinitas. Until that time, he had been using the letters p (1734), or c (1736). In 1742, 
Christian Goldbach (1690-1764) also used zr. After the publication of Euler’s treatise: In- 
troductio in Analysin Infinitorum (1748), 1 became a standard symbol, as was the case 
with other notations he adopted. In 1737, Euler also showed that both e and e? are irra- 
tional and gave several continued fractions for e. In another paper, De variis modis circuli 
quadraturam numeris proxime exprimendi of 1737, Euler derived the formulas 


1 1 
tan(~) = tan'(—_) + tan ( 4) (22) 
P pt+q po +pqrl 
- b- -b 
tar(*) = tan (S *) + tan") +tan'( <<") tees 
y ay+x ab+1 ch+1 


and these give rise to any amount of relations for 7; for example, if x = 1 = y, and the odd 


and 


numbers are substituted for a, b,c,..., we obtain 


dd afi afi afl 
—=tan —})+tan — | +tan —])+-.- 
4 2 8 18 


The proof of (22) immediately follows by comparing the angles in the identity 
(p+ q+ i)(p’ +pq+1+ ig) = [p+ +1]@+i. 


1739. Matsunaga Yoshisuke (died in 1844) was a prolific writer. In modern terms, he 
used the hypergeometric series 


abx a(a+1)b(b+1)x? ala +1)(a + 2)b(b + 1)(b + 2)x° 


F(a,b,¢,x) =1 - 
ae eared Dic(e+1) 3ic(c + 1)(c + 2) ‘ 


for a = 1/2, b=1/2, c= 3/2, and x = 1/4, i.e., the series 


1131 1? 3? 3°. 5? 
nm =3F( -,-,-,-—]=3(1+ + — + tees 
2224 4-2-3 42.2-3-4-5  48.2-3-4-5-6-7 
to compute z correct to 50 digits. He also gave the following series: 


a 1? 1.2? 12.22. 3? 
mw =9(1+ + + teed. 
3-4 3-4-5-6 3-4-5-6-7-8 


1748. The following expansion of z is due to Euler 


1 1 111 21 #21 ~=21 ~21 
4+—- tts +ot+=—+ +—+—-— te, 
4 5 6 7 8 9 10 1 12 13 
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where the signs are determined following the rule: If the denominator is a prime of the 
form 4m — 1, the sign is positive; if the denominator is 2 or a prime of the form 4m + 1, 
the sign is negative; for composite numbers, the sign is equal to the product of signs of its 


factors. The following curious infinite product was also given by Euler: 


11 13 17 #19 23 29 31 
12 12 16 20 24 28 32 ” 


where the numerators are the odd primes and each denominator is the multiple of four 
nearest to the numerator. 

1750. Henry Sullamar, a real Bedlamite, found the quadrature of the circle in the number 
666 inscribed on the forehead of the beast in the Revelations. He published periodically 
every two or three years some pamphlet in which he endeavored to prop his discovery. 

1753. M. de Causans of the Guards cut a circular piece of turf, squared it and from the 
result deduced original sin and the Trinity. He found that the circle was equal to the square 
in which it is inscribed, i.e., 7 = 4. He offered a reward for the detection of any error, and 
actually deposited 10,000 francs as earnest of 300,000. But the courts did not allow any 
one to recover. 

1754. Jean Etienne Montucla (1725-1799) was an early French historian of mathematics. 
He published an anonymous treatise entitled Histoire des récherches sur la quadrature du 
cercle, and in 1758 the first part of his great work Histoire des mathématiques. 

1755. Euler in his treatise De relatione inter ternas pluresve quantitates instituenda, 
which was published ten years later, wrote ‘It appears to be fairly certain that the periphery 
ofa circle constitutes such a peculiar kind of transcendental quantities that it can in no way 
be compared with other quantities, either roots or other transcendentals: This conjecture 


haunted mathematicians for 107 years. The following expansion is due to Euler: 


1. If, 2 2-45 2-4-6, 
t =-(1+—y+—— oe |i 23 
an” x (a4 Zy4 oy et Me (23) 


where y = x?/(1 +x”). It converges rapidly. 

1760. Georges Louis Leclerc (Comte of Buffon 1707-1788) was a naturalist, mathemati- 
cian, cosmologist and encyclopedic author. Suppose a number of parallel lines, distance a 
apart, are ruled on a horizontal plane, and suppose a homogeneous uniform rod of length 
£ < ais dropped at random onto the plane. Buffon showed that the probability that the rod 
will fall across one of the lines in the plane is given by p = (2€/za). In the literature, this 
problem is known as Buffon’s needle problem. This was the earliest problem in geometric 
probability to be solved. By actually performing this experiment, a large number of times 
and noting the number of successful cases, we can compute an approximation for z. 

1761. Johann Heinrich Lambert (1728-1777) was the first to introduce hyperbolic func- 
tions into trigonometry. He wrote landmark books on geometry, the theory of cartog- 
raphy, and perspective in art. He is also credited for expressing Newton's second law of 
motion in the notation of the differential calculus. Lambert used the properties of con- 
tinued fractions to show that z is irrational. He published a more general result in 1768. 


Lambert also showed that the functions e* and tanx cannot assume rational values if x is 
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a non-zero rational number. He also gave an interesting continued fraction for 7, 


3 22 333 355 103,993 104,348 208,341 


, 


1 7 106’ 113 33,102 ” 33,215 ” 66,317 ’ 
312,689 833,719 1,146,408 4,272,943 
99,532’ 265,381’ 364,913 ” 1,360,120 ° 


1766. Arima Yoriyuki (1714-1783) was a Japanese mathematician of the Edo period. He 


found the following rational approximation of z, which is correct to 29 digits 


_ 428,224,593,349,304 
~ 136,308,121,570,117 ” 


1775. The French Academy of Sciences passed a resolution henceforth not to examine 
any more solutions of the problem of squaring the circle. In fact, it became necessary to 
protect its officials against the waste of time and energy involved in examining the efforts of 
circle squarers. A few years later, the Royal Society in London also banned consideration of 
any further proofs of squaring the circle. This decision of the Royal Society was described 
by Augustus De Morgan (1806-1871) about 100 years later as the official blow to circle- 
squarers. 

1776. Charles Hutton (1737-1823) was an English mathematician. He wrote several 
mathematical texts. In 1774, he was elected a fellow of the Royal Society of London. He 


suggested Machin’s stratagem in the form 


1 3 
x =20tan7| — ) +8tan?( — }; (24) 
7 79 


however, he did not carry computations far enough. Euler also developed the formula (24). 

1778. M. de Vausenville, one of the deluded individuals, brought an action against the 
French Academy of Sciences to recover a reward to which he felt himself entitled. It ought 
to be needless to say that there was no reward offered for squaring the circle. 

1779. Euler used his expansion (23) to evaluate right terms of (24), to calculate z to 20 
decimal places in one hour! 

About 1785. Franz Xaver Freiherr von Zach (1754-1832) discovered a manuscript by an 
unknown author in the Radcliffe Library, Oxford, which gives the correct value of z to 152 
decimal places. Zach was elected a member of the Royal Swedish Academy of Sciences in 


1794, a Fellow of the Royal Society in 1804, and an honorary member of the Hungarian 
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Academy of Sciences in 1832. Asteroid 999 Zachia and the crater Zach on the Moon are 
named after him. 

1789. Baron Jurij Bartolomej Vega (Georg Vega 1754-1802) was a Slovene mathemati- 
cian, physicist and artillery officer. He wrote six scientific papers. The record of de Lagny 
of 127 digits seems to have stood until 1789, when Vega, using a new series for the arctan- 
gent discovered by Euler in 1755, calculated 140 decimal places (126 correct). Vega’s result 
showed that de Lagny’s string of digits had a 7 instead of an 8 in the 113th decimal place. 
His article was not published until six years later, in 1795 (136 correct). Vega retained his 
record for 52 years until 1841. 

1794. Adrien-Marie Legendre (1752-1833) is remembered for Legendre functions, law 
of quadratic reciprocity for residues, standardizing weights and measures to the metric 
system, supervising the major task of producing logarithmic and trigonometric tables, 
least squares method of fitting a curve to the data available, proof of Fermat’s last theorem 
for the exponent 1 = 5, Gauss-Legendre algorithm, Legendre’s constant, Legendre’s equa- 
tion, Legendre polynomials, Legendre’s conjecture, and Legendre transformation. The 
Legendre crater on the Moon is named after him. Legendre, in his Elements de Géometrie 
(1794) used a slightly modified version of Lambert’s argument to prove the irrationality of 
= more rigorously, and also gave a proof that x” is irrational. He writes: ‘It is probable that 
the number z is not even contained among the algebraic irrationalities, i.e., that it cannot 
be the root of an algebraic equation with a finite number of terms, whose coefficients are 
rational. But, it seems to be very difficult to prove this strictly’ 

1795. Ajima Naonobu (1732-1798), also known as Ajima Chokuyen, was a Japanese 
mathematician of the Edo period. The series he developed can be simplified as 


a 3 1 ly 2! 3! 4! 
=F(1,1,-, =(1l+—+ + + tee 
2 2:2. 3 3:5 3-5-7 3-5-:7-9 


3 i! (it)? 2 
(21+)! Ss (i+ D! 


It is interesting to note that the above series follows from (5) by using an acceleration 
technique known in the literature as Euler’s transform. It can also be derived from the 
Wallis product formula (8). 

1795. Jean-Charles Callet (174-1799) in his tables gave 154 (152 correct) decimal digits 
of z. 

1797. Lorenzo Mascheroni (1750-1800) was educated with the aim of becoming a priest 
and he was ordained at the age of 17. In 1790, he calculated Euler’s constant to 32 (19 cor- 
rect) decimal places. Lorenzo dedicated his book, Geometria del compasso, to Napoleon 
Bonaparte. In this work, he proved that all Euclidean constructions can be made with 
compasses alone, so a straight edge in not needed. However, it was proved earlier in 1672 
by the Danish mathematician Georg Mohr (1640-1697). He claimed that compasses are 
more accurate then those of a ruler. 

1800. Karl Friedrich Gauss (1777-1855) was one of the greatest mathematicians of all 
time. Alexander von Humboldt (1769-1859), the famous traveler and amateur of the sci- 
ences, asked Pierre Simon de Laplace (1749-1827) who was the greatest mathematician in 
Germany, Laplace replied Johann Friedrich ‘Pfaff’ (1765-1825). ‘But what about Gauss’ the 


astonished von Humboldt asked, as he was backing Gauss for the position of director at 
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the Gottingen observatory. ‘Oh; said Laplace, ‘Gauss is the greatest mathematician in the 
world: Gauss suggested to his teacher Pfaff to study the sequences {x,,} and {y,,} generated 
by the recurrence relations 


1 
Xn = 3 en +IYn)s Yn =/SknviIn > 0. (25) 


In his reply, Pfaff showed that for any positive numbers x and yo these sequences converge 


monotonically to a common limit given by 


(93, — 02)? /cos*(xo/yo), O<x0 <Jo, 


y 26 
(x2 — y2)"?/ cosh (xo/¥o), 0 <J0 < Xo. (26) 


B(xo, Yo) = 


Pfaff’s letter was unpublished. In 1881, Carl Wilhelm Borchardt (1817-1880) work was pub- 
lished in which he rediscovered this result which now bears his name. For this, it suffices 
to note that: 

1. {x,} and {y,} converge monotonically to the same limit. 

2. The ratio ry = x;,/y, satisfies r2,, = (1 +r,)/2. 

3. Ifxo < yo, let @ = cos ro. Then, s, = 2” cos“! r, = 6 and cy = 4" (x2 — y?) = (xo — y2) 


are independent of n. 

2 len |? 
sin! (2 Jen |¥/2/yn) 
If yo < xo, we let 6 = cosh ro, and follow similarly. 

Now we let x, = 1/dy, Vn = 1/b,, then (25) and (26) take the form 


F 2_42y1/2 
lent? — %0-%0) 


4, Limp +00 Yn = LIM y+ 00 = Lim 00 7 


24,Dy 
Ans = a Dns HV Ann; n=0 (27) 
Ayn + by 
and 
1 2 _ f2)-1/2 egg-} : >0, 
A(ao, bo) _ _ Adobo (a5 0) cos (bo/ao) ao > bo >0 (28) 


B(1/ao,1/bo) aybo(b2 — a2)? cosh" (bo/ao), bo > do > 0. 


Clearly, the recurrence relations (2) are different from (27). In fact, (2) minimize the count 
of arithmetic operations. In particular, if we let ay = 2,/3, bo = 3, then (27) in view of (28) 
converges to 7. 

In what follows, we let the constant c = 4"(x2 — y?) = (x2 —y), then we can uncouple (25) 
and (27), respectively, to obtain 


Xn ey -n-1 ~ 2 yy, =n 1. -2\1/2 
*ni = 5 + 2 —4 c ? na = 5 (L+ (144 cy, ) ) (29) 
and 
g2ntl 1/2 g2ntl 1/2 
Ans1 = — (l-(1-4"caz)""), Bh = ((1+4-"cb*) ~ -1) (30) 
WL 


From (29) and (30), several known and new recurrence relations can be obtained. 
Gauss also developed the Machin-type formula 


x afl afl afl 
— =12tan™ | — } + 8tan~( — } —5tan~( —— }. (31) 
4 18 57 239 
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He also estimated the value of z by using lattice theory and considering a lattice inside a 
large circle, but he did not pursue it further. 
1810. Sakabe Kohan (1759-1824) developed the series 


a 1 1:4 (1-3)(4-6) (1-3+5)(4-6-8) 
4 5 5-7-9 5-7-9113) 57-911 13-15-17 : 


1818. Wada Yenzo Nei (known as Wada Yasushi, 1787-1840) developed over one hun- 
dred infinite series expressing directly or indirectly 7. One of his series can be written 
as 


113 P 12.3% 12.32.52 
nw =2F| —,-,—,1] =2(1+ + + qeece if, 
2’2’2 3! 5! 7! 


1825. Malacarne of Italy published a geometric construction in Géométrique (Paris), 
which leads to the value of z less than 3. 

1828. C.G. Specht of Berlin published a geometric construction in Crelle’s Journal, Vol- 
ume 3, p.83, which leads to 7 = 13./146/50 = 3.1415919.... 

1832. Karl Heinrich Schellbach (1809-1890) began with the relation 


Ti 


al In(i) = in( 4) =In(1+i)-InQ-d), 


1-i 


which is due to Giulio Carlo Fagnano dei Toschi (1682-1766), and used the logarithm ex- 
pansion (7), to obtain 


which immediately gives Nilakanthan series (5). He also considered the relation 


wi (2 + i)(3 + i) 
ge (5 2 Nena) 


spa) a) hs) | 


and used the expansion (7), to obtain, compare to (5), a fast converging expansion 


54 1 1 1/1 1 1/1 1 
= + + + + - 
4 2773 3\23 33 5\ 2° 38 


1833. William Baddeley in his work Mechanical quadrature of the circle, London Me- 


chanics’ Magazine, August, 1833 writes ‘From a piece of carefully rolled sheet brass was 
cut out a circle 1 9/10 inches in diameter, and a square 17/10 inches in diameter. On weigh- 
ing them, they were found to be exactly the same weight, which proves that, as each are of 
the same thickness, the surfaces must also be precisely similar. The rule, therefore, is that 
the square is to the circle as 17 to 19? We believe for the square it must be the side (not the 
diameter). Then it follows that z = 1,156/361 = 3.202216.... 
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1836. Joseph LaComme ‘at a time when he could neither read nor write being desirous 
to ascertain what quantity of stones would be required to prove a circular reservoir he 
had constructed, consulted a mathematics professor. He was told that it was impossible to 
determine the full amount, as no one had yet found the exact relation between the circum- 
ference of a circle and its diameter. The well-sinker thereupon, full of self-confidence in 
his powers, applied himself to the celebrated problem and discovered the solution, which 
lead to z = 25/8 by mechanical process. He then taught himself to read and write, and 
managed to acquire some knowledge of arithmetic by which he verified his mechanical 
solution’ Joseph was honored for his profound discovery with several medals of the first 
class, bestowed by Parisian societies. 

1841. William Rutherford (1798-1871) was an English mathematician. He calculated x 
to 208 places of which 152 were later found to be correct. For this, he employed Euler’s 
formula 


of afl afl afl 
— =4tan | — ) —tan“( — } +tan~ {| — 
4 5 70 99 


and Madhava’s series expansion (4). 

1844. Johann Martin Zacharias Dase (1824-1861) was a calculating prodigy. At the 
age of 15, he gave exhibitions in Germany, Austria and England. His extraordinary cal- 
culating powers were timed by renowned mathematicians including Gauss. He multi- 
plied 79,532,853 x 93,758,479 in 54 seconds; two 20-digit numbers in 6 minutes; two 
40-digit numbers in 40 minutes; and two 100-digit numbers in 8 hours 45 minutes. In 
1840, he made acquaintance with Viennese mathematician L.K. Schulz von Strasznicky 
(1803-1852) who suggested him to apply his powers to scientific purposes. When he was 
20, Strasznicky taught him the use of the formula 


T afl afl afl 
—=tan™{ — }+tan™ | — |} +tan™|{ — }, 
4 2 5 8 


and asked him to calculate z. In two months, he carried the approximation to 205 places 
of decimals, of which 200 are correct. He next calculated a 7-digit logarithm table of the 
first 1,005,000 numbers; he did this in his off-time from 1844 to 1847, when occupied by 
the Prussian survey. His next contribution of two years was the compilation of hyperbolic 
table in his spare time which was published by the Austrian Government in 1857. Next, he 
offered to make a table of integer factors of all numbers from 7,000,000 to 10,000,000; for 
this, on the recommendation of Gauss the Hamburg Academy of Sciences agreed to assist 
him financially, but Dase died shortly thereafter in Hamburg. He also had an uncanny 
sense of quantity. That is, he could just tell, without counting, how many sheep were ina 
field, or words in a sentence, and so forth, up to about 30. 

1844. Hiromu Hasegawa (1810-1887) and his father Hiroshi Hasegawa (1782-1838) pub- 
lished many Wasan books. Hiromu developed the series 


T 1 1 1 5 


=T a Sa, 
4 2-3 5-8 7-16 9-128 


This series can be written as 


4 135 
n= —F\| —,-,-,1}. 
3 G 2’2 ) 
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1847. Thomas Clausen (1801-1885) wrote over 150 papers on pure mathematics, applied 
mathematics, astronomy and geophysics. He used the formula 


1 4f1 afl 
— =2tan™~| — |} +tan | — 
4 2 iz 


to calculate 2 to 250 decimal places, but only 248 are correct. In 1854, he factored the 
sixth Fermat number as 2° + 1 = 67,280,421,310,721 x 274,177. Clausen also gave anew 
method of factorising numbers. 

1849. Jacob de Gelder (1765-1848) a mathematical ideologist proposed a geometric con- 
struction which gives z correct to 6 decimal places. His method is based on the fact that 
w = 355/113 = 3 + 4°/(7? + 87). Gelder’s result was published in 1849. 

1851. Joseph Liouville (1809-1882) was a highly respected professor at the Collége de 
France in Paris, and the founder and for 39 years the editor of the Journal des Mathé- 
matiques Pures et Appliquées. His ingenious theory of fractional differentiation answered 
the long-standing question of what reasonable meaning can be assigned to the symbol 
d"y/dx" when n is not a positive integer. In 1844, Liouville showed that e is not a root 
of any quadratic equation with integral coefficients. This led him to conjecture that e is 
transcendental. In 1851, Liouville showed, by using continued fractions, that there are an 
infinite number of transcendental numbers, a result which had previously been suspected 
but had not been proved. He produced the first examples of real numbers that are provably 
transcendent. One of these is 


“1 1 1 1 
Yo ape tat ete = 011000100... 
10" 10!" 102 106 


His methods led to extensive further research. 
1853. Lehmann correctly calculated 261 decimal places of z. For this, he used Euler’s 
formula 


a tar(5) + tar(5). (32) 
4 2 3 


1853. Rutherford obtained 440 correct decimal places. 

1853. William Shanks (1812-1882) was a British amateur mathematician. He used 
Machin formula (18) to calculate 7 to 607 decimal places. He was assisted by Rutherford 
in checking first 440 digits. 

1853-1855. Richter in 1853 published 333 digits (330 correct), and in 1855 (after his 
death in 1854) 500 decimal places. 

1860. James Smith published the value of z as 3 1/8 and argued that it is exact and 
correct. He attempted to bring it before the British Association for the Advancement of 
Science. Interestingly, even De Morgan and Hamilton could not convince him for his mis- 
take. 

1861. Philip H. Vanderweyde published an essay discussing the subject 2. He also used 
several constructions, resulting 7 = 3.1415926535.... 

1862. Lawrence Sluter Benson published about 20 pamphlets on the area of the cir- 
cle, three volumes on philosophic essays, and one on geometry The Elements of Eu- 
clid and Legendre. He demonstrated that the area of the circle is equal to 3R?, or the 
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arithmetical square between the inscribed and circumscribed squares. According to him 
/12 = 3.4641016... is the ratio between the diameter of a circle and the perimeter of its 
equivalent square. The ratio between the diameter and circumference, he believed, is not 
a function of the area of the circle. He accepted the value of 2 = 3.141592... 

1863. S.M. Drach proved that the circumference of a circle can be obtained as follows: 
From thrice diameter, deduct 8/1,000 and 7/1,000,000 of a diameter, and add 5% to the 
result, i.e., 


16 14 5 16 14 
27m =6- + 6 - F 
1,000 1,000,000 100 1,000 1,000,000 


which gives 7 = 3.14159265. 

1868. Cyrus Pitt Grosvenor (1792-1879) was an American anti-slavery Baptist minis- 
ter. In his retirement, he worked on the problem of squaring the circle. He described his 
method in a pamphlet titled The circle squared, New York: Square the diameter of the 
circle; multiply the square by 2; extract the square root of the product; from the root sub- 
tract the diameter of the circle; square the remainder; multiply this square by four fifths; 
subtract the square from the diameter of the circle, i.e., 


nD? 
4 


=D > (v2BE D) =D? : a 2 (/2- »| = D*(0.785533906...), 
which gives m = 3.142135.... 

1872. Augustus De Morgan (1806-1871) was born in Madura (India), but his family 
moved to England when he was seven months old. He lost the sight of his right eye shortly 
after birth. He was an extremely prolific writer. He wrote more than 1,000 articles for more 
than 15 periodicals. De Morgan also wrote textbooks on many subjects, including logic, 
probability, calculus and algebra. In 1866, he was a co-founder of the London Mathemat- 
ical Society and became its first President. His book A Budget of Paradoxes of 512 pages, 
which was edited and published by his wife in 1872, is an entertaining text. This book 
contains the names of 75 writers on z. In this work, De Morgan reviewed the works of 42 
of these writers, bringing the subject down to 1870. He once remarked that it is easier to 
square the circle then to get round a mathematician. He was the first to point out that in 
the decimal expansion of z one should expect each of the 10 digits appear uniformly, i.e., 
roughly one out of ten digits should be a 4, etc. 

1872. Asaph Hall (1829-1907) was an astronomer. He published the results of an ex- 
periment in random sampling that Hall had convinced his friend, Captain O.C. Fox, to 
perform when Fox was recovering from a wound received at the Second Battle of Bull 
Run. The experiment was based on Buffon’s needle problem. After throwing his needles 
eleven hundred times, Fox was able to derive 7 ~ 3.14. This work is considered as a very 
early documentation use of random sampling (which Nicholas Constantine Metropolis 
(1915-1999) named as the Monte Carlo method during the Manhattan Project of World 
War II). 

1873. Charles Hermite (1822-1901) in 1870 was appointed to a professorship at the Sor- 
bonne, where he trained a whole generation of well-known French mathematicians. He 
was strongly attracted to number theory and analysis, and his favorite subject was elliptic 
functions, where these two fields touch in many remarkable ways. His proof of the tran- 


scendence of e was high point in his career. 
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1873-1874. William Shanks again used Machin formula (18) to calculate 2 to 707 dec- 
imal places (published in the Proceedings of the Royal Society, London), but only 527 
decimal places are correct. For this, he used mechanical desk calculator and worked for 
almost 15 years. For a long time, this remained the most fabulous piece of calculation ever 
performed. In the Palais de la Découverte (a science museum in Paris), there is a circular 
room known as the ‘pi room: On its wall are inscribed these 707 digits of z. The digits are 
large wooden characters attached to the dome-like ceiling. Shanks also calculated e and 
the Euler-Mascheroni constant y to many decimal places. He published a table of primes 
up to 60,000 and found the natural logarithms of 2, 3, 5 and 10 to 137 places. 

1874. Tseng Chi-Hung (died in 1877) finds 100 digits of x in a month. For this, he used 
the formula (32). 

1874. John A. Parker in his book The Quadrature of the Circle. Containing Demonstra- 
tions of the Errors of Geometers in Finding Approximations in Use published by John Wiley 
& Sons, New York claims that 2 = 20,612/6,561 exactly. He exclaims, ‘all the serial and al- 
gebraic formula in the world, or even geometrical demonstration, if it be subjected to any 
error whatever, cannot overthrow the ratio of circumference to diameter which I have es- 
tablished’ He praises Metius (lived in the sixteenth century) for using the ratio 355/113. 
His book also contains practical questions on the quadrature applied to the astronomical 
circles. 

1876. Alick Carrick proposed in his book, The Secret of the Circle, its Area Ascertained, 
the value of z as 3 1/7. 

1879. Pliny Earle Chase (1820-1886) was a scientist, mathematician, and educator who 
mainly contributed to the fields of astronomy, electromagnetism and cryptography. In his 
pamphlet, Approximate Quadrature of the Circle, he used a geometric construction to 
obtain m7 = 3.14158499.... 

1882. Carl Louis Ferdinand von Lindemann (1852-1939) worked on non-Euclidean ge- 
ometry. He followed the method of Hermite to show that zr is also transcendental. His re- 
sult showed at last that the age-old problem of squaring the circle by a ruler-and-compass 
construction is impossible. Lindemann’s paper runs to 13 pages of tough mathematics. 
Karl Wilhelm Weierstrass (1815-1897), the apostle of mathematical rigor, simplified the 
proof of Lindemann’s theorem somewhat in 1885, and it was further simplified in later 
years by renowned mathematicians (Stieltjes, Hurwitz, Hilbert, and others). The inter- 
ested reader is referred to the comparatively easy version given by Hobson. Nonetheless, 
there are still some amateur mathematicians who do not understand the significance of 
this result, and futilely look for techniques to square the circle. Next, Lindemann spent sev- 
eral years to provide the proof of Fermat’s Last Theorem, which is unfortunately wrong. 
He also worked on projective geometry, Abelian functions and developed a method of 
solving equations of any degree using transcendental functions. 

1888. Sylvester Clark Gould (1840-1909) was the editor of Notes and Queries, Manch- 
ester, New Hampshire. He compiled the bibliography entitled What is the Value of Pi. It 
contains 100 titles and gives the result of 63 authors. In this work the diagram 16 claims 
that 7 = 3 3,949/27,889 exactly. 

1892. A writer announced in the New York Tribune the rediscovery of a long-lost secret 
that gives 3.2 as the exact value of 7. This announcement caused considerable discussion, 
and even near the beginning of the twentieth century 3.2 had its advocates as against the 
value 22/7. 
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1896. Fredrik Carl Miilertz Stormer (1874-1957) was a mathematician and physicist, 
known for his work in number theory. He gave the following Machin-like formulas for 


calculating x 


1 af afl afl afl 

— =44tan~( — | + 7tan~ ( —— } -12tan~ | —— } + 24tan (33) 
4 57 239 682 12,943 

1 1 1 1 

— =6tan?( — ) + 2tan?( — ) +tan7?( —— J. (34) 
4 8 57 239 


1897. In the State of Indiana, the House of Representatives unanimously passed the Bill 


and 


No. 246 (known as the ‘z bill’) introducing a new mathematical truth ‘Be it enacted by the 
General Assembly of the State of Indiana: It has been found that a circular area is to the 
square on a line equal to the quadrant of the circumference, as the area of an equilateral 
rectangle is to the square on one side...’ (7 = 3.2). The author of the bill was a physician, 
Edwin J. Goodman (1825-1902), M.D., of Solitude, Posey County, Indiana, and it was in- 
troduced in the Indiana House on 18 January 1897, by Mr. Taylor I. Record, representative 
from Posey County. Edwin offered this contribution as a free gift for the sole use of the 
State of Indiana (the others would evidently have to pay royalties). Copies of the bill are 
preserved in the Archives Division of the Indiana State Library. The bill was sent to the 
Senate for approval. Fortunately, during the House’s debate on the bill, Purdue University 
Mathematics Professor Clarence Abiathar Waldo (1852-1926) was present. When Profes- 
sor Waldo informed the Indiana Senate of the ‘merits’ of the bill, the Senate, after some 
ridicule at the expense of their colleagues, indefinitely postponed voting on the bill and let 
it die. 

1900. H.S. Uhler used Machin’s formula (18) to compute z to 282 decimal places. 

1901. Mario Lazzarini an Italian mathematician performed the Buffon’s needle experi- 
ment. Tossing a needle 3,408 times, he obtained the well-known estimate 355/113 for z. 
Although it is an impressive observation, but suspiciously good. In fact, statisticians Sir 
Maurice George Kendall (1907-1983) and Patrick Alfred Pierce Moran (1917-1988) FRS 
have commented that one can do better to cut out a large circle and use a tape to mea- 
sure to find its circumference and diameter. On the same theme of phoney experiments, 
Gridgeman, in 1960, pours scorn on Lazzerini and others, created some amusement by 
using a needle of carefully chosen length k = 0.7857, throwing it twice, and hitting a line 
once. His estimate for z was thus given by 2 x 0.7857/m = 1/2 from which he got the highly 
creditable value of z = 3.1428. Of course, he was not being serious. 

1902. Duarte used Machin’s formula (18) to compute 7 to 200 decimal places. 

1906. Various mnemonic devices have been given for remembering the decimal digits 
of 2. The most common type of mnemonic is the word-length mnemonic in which the 
number of letters in each word corresponds to a digit, for example, How I wish I could 
calculate pi (by C. Heckman), May I have a large container of coffee (by Martin Gardner), 
and How I want a drink, alcoholic of course, after the heavy lectures involving quantum 
mechanics (by Sir James Jeans), respectively, give 2 to seven, eight, and fifteen decimal 
places. Adam C. Orr in Literary Digest, vol. 32 (1906), p.84 published the following poem 
which gives z to 30 decimal places: 
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Now I, even I, would celebrate 

In rhymes inapt, the great 

Immortal Syracusan, rivaled nevermore 
Who in his wondrous lore, 

Passed on before 

Left men his guidance, 


How to circles mensurate. 


Several other such poems not only in English, but almost in every language including Al- 
banian, Bulgarian, Czech, Dutch, French, German, Italian, Latin, Polish, Portuguese, Ro- 
manian, Spanish and Swedish are known. However, there is a problem with this type of 
mnemonic, namely, how to represent the digit zero. Fortunately, a zero does not occur 
in z until the thirty-second place. Several people have come up with ingenious methods 
of overcoming this, most commonly using a ten-letter word to represent zero. In other 
cases, a certain piece of punctuation is used to indicate a naught. Michael Keith (with such 
similar understanding) in his work Circle digits: a self-referential story, Mathematical In- 
telligencer, vol. 8 (1986), 56-57, wrote an interesting story which gives first 402 decimals 
of z. 

1913. Ernest William Hobson (1856-1933) was Sadleirian Professor at the University 
of Cambridge from 1910 to 1931. His 1907 work on real analysis was very influential in 
England. In his book, Squaring the circle: A History of the Problem, he used a geometrical 
construction to obtain z = 3.14164079.... 

1914. Srinivasa Ramanujan (1887-1920) was a famous mathematical prodigy. He col- 
laborated with Hardy for five years, proving significant theorems about the number of 
partitions of integers. Ramanujan also made important contributions to number theory 
and also worked on continued fractions, infinite series and elliptic functions. In 1918, he 
became the youngest Fellow of the Royal Society. According to Hardy, ‘the limitations of 
Ramanujan’s knowledge were as startling as its profundity: Here was a man who could 
workout modular equations and theorems of complex multiplication, to orders unheard of, 
whose mastery of continued fractions was, beyond that of any mathematician in the world, 
who had found for himself the functional equation of the zeta-function, and the dominant 
terms of the many of the most famous problems in the analytic theory of numbers; and 
he had never heard of a doubly periodic function or of Cauchy's theorem, and had indeed 
but the vaguest idea of what a function of a complex variable was. Ramanujan considered 
mathematics and religion to be linked. He said, ‘an equation for me has no meaning unless 
it expresses a thought of God’ He was endowed with an astounding memory and remem- 
bered the idiosyncrasies of the first 10,000 integers to such an extent that each number 
became like a personal friend to him. Once Hardy went to see Ramanujan when he was in 
a nursing home and remarked that he had traveled in a taxi with a rather dull number, viz 
1,729, Ramanujan exclaimed, ‘No, Hardy, 1,729 is a very interesting number. It is the small- 
est number that can be expressed as the sum of two cubes viz 1,729 = 1° +123 = 9° + 10%, and 
the next such number is very large’. His life can be summed up in his own words, ‘I really 
love my subject: His 1914 paper on ‘Modulus functions and approximation to z’ contains 
several new innovative empirical formulas and geometrical constructions for approximat- 


ing 7. One of the remarkable formulas for its elegance and inherent mathematical depth 
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1 V8 (4m)! (1,103 + 26,390m) 
aa (35) 


= (m4 3964" 
It has been used to compute z to a level of accuracy, never attained earlier. Each additional 
term of the series adds roughly 8 digits. He also developed the series 


1 QA /2m\*42m+5 2 A (-1)"(4m + Y[(2m - YP 
—= > ( g12m+4 and = > (ann - 


The first series has the property that it can be used to compute the second block of k 
(binary) digits in the decimal expansion of z without calculating the first k digits. The 
following mysterious approximation which approximates z to 18 correct decimal places 
is also due to Ramanujan 

~ 2 


1 = 2 In((2V2 + VI0)(3-+ V0) 


1914. T.M.P. Hughes in his work A triangle that gives the area and circumference of 
any circle, and the diameter of a circle equal in area to any given square, Nature 93, 110, 
doi:10.1038/093110a0 uses a geometric construction to obtain 2 = 3.14159292035.... 

1928. In March 1928, the University of Minnesota was notified that Gottfried Lenzer (a 
native of Germany who lived in St. Paul for many years) had bequeathed to the university a 
series of 60 drawings from 1911-1927 and explanatory notes concerning the three classical 
problems of antiquity. He used a geometrical construction for squaring the circle to obtain 
mw =3.1378.... 

1934. Alexander Osipovich Gelfond (1906-1968) was a Soviet mathematician. He 
proved that e” (Gelfond’s constant) is transcendental, but nothing yet is known about 
the nature of any of the numbers z + e, we or 7°. 

1934. Helen Abbot Merrill (1864-1949) earned her Ph.D. from Yale in 1903 on the thesis 
On Solutions of Differential Equations which Possess an Oscillation Theorem. She served as 
an associate editor of The American Mathematical Monthly during 1916-1919, and was a 
vice-president from 1920 to 1921 of the Mathematical Association of America. Her book 
Mathematical Excursions: Side Trips Along Paths not Generally Traveled in Elementary 
Courses in Mathematics, Bruce Humphries, Inc., Boston, 1934 was a text for the general 
public. In this book, a geometric construction is given (perhaps by an earlier author) which 
leads to m = 3.141591953.... 

1934. Edmund Georg Hermann (Yehezkel) Landau (1877-1938) was a child prodigy. In 
1903, he gave a simpler proof of the prime number theorem. His masterpiece of 1909 was a 
treatise Handbuch der Lehre von der Verteilung der Primzahlen a two volume work giving 
the first systematic presentation of analytic number theory. Landau wrote over 250 papers 
on number theory, which had a major influence on the development of the subject. De- 
spite his outstanding talents as both a teacher and researcher, Landau annoyed many of his 
colleagues at Gottingen. He started criticizing privately, and often publicly, their results. 
Landau in his work defined 7/2 as the value of x between 1 and 2 for which cos x vanishes. 


One cannot believe this definition was used, at least as an excuse, for a racial attack on 
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Landau. This unleashed an academic dispute which was to end in Landau’s dismissal from 
his chair at Gottingen. Ludwig Georg Elias Moses Bieberbach (1886-1982) famous for his 
conjecture, explained the reasons for Landau’s dismissal: “Thus the valiant rejection by the 
Gottingen student body which a great mathematician, Edmund Landau, has experienced 
is due in the final analysis to the fact that the un-German style of this man in his research 
and teaching is unbearable to German feelings. A people who have perceived how mem- 
bers of another race are working to impose ideas foreign to its own must refuse teachers of 
an alien culture: Hardy replied immediately to Bieberbach about the consequences of this 
un-German definition of zr: ‘There are many of us, many Englishmen and many Germans, 
who said things during the War which we scarcely meant and are sorry to remember now. 
Anxiety for one’s own position, dread of falling behind the rising torrent of folly, deter- 
mination at all cost not to be outdone, may be natural if not particularly heroic excuses. 
Professor Bieberbach’s reputation excludes such explanations of his utterances, and I find 
myself driven to the more uncharitable conclusion that he really believes them true’ 

1934. A Cleveland businessman Carl Theodore Heisel published a book Mathemati- 
cal and Geometrical Demonstrations in which he announced the grand discovery that 
was exactly equal to 256/81, a value that the Egyptians had used some 4,000 years ago. 
Substituting this value for calculations of areas and circumferences of circles with diam- 
eters 1,2,... up to 9, he obtained numbers which showed consistency of circumference 
and area, ‘thereby furnishing incontrovertible proof of the exact truth’ of his ratio. He also 
rejected decimal fractions as inexact (whereas ratios of integers as exact and scientific), 
and extracted roots of negative numbers thus: ./—a = ./a-, Ja = -a. He published this 
book on his own expense and distributed to colleges and public libraries throughout the 
United States without charge. 

1934. Miff Butler claimed discovery of a new relationship between z and e. He stated 
his work to be the first basic mathematical principle ever developed in USA. He convinced 
his congressman to read it into the Congressional Record on 5 June 1940. 

1940. H.S. Uhler used Machin’s formula (18) to compute z to 333 decimal places. 

1945-1947. D.F. Ferguson of England used the formula 


T afl afl 4 1 
— =3tan | — )+tan~( — } +tan~ | —— 
4 4 20 1,985 


to find that his value disagreed with that of William Shanks in the 528th place. In 1946, he 
approximated z to 620 decimal places, and in January 1947 to 710 decimal places. In the 
same month William Shanks used Machin’s formula (18) to compute 808-place value of 
x, but Ferguson soon found an error in the 723rd place. For all the calculations, he used 
desk calculator. 

1947. Ivan Morton Niven (1915-1999) gave an elementary proof that zr is irrational. 

1949. Ferguson and John William Wrench, Jr. (1911-2009) using a desk calculator, com- 
puted 1,120 decimal digits of 2. This record was broken only by the electronic computers. 

September 1949. John Wrench and L.R. Smith (also attributed to George Reitwiesner 
et al.) were the first to use an electronic computer Electronic Numerical Integrator and 
Computer (ENIAC) at the Army Ballistic Research Laboratories in Aberdeen, Maryland, 
to calculate 2 to 2,037 decimal places. For this, they programed Machin’s formula (18). 
It took 70 hours, a pitifully long time by today’s standards. In this project, John Louis 
von Neumann (1903-1957), one of the most versatile and smartest mathematicians of the 
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twentieth century, also took part. In 1965, The ENIAC became obsolete, and it was dis- 
membered and moved to the Smithsonian Institution as a museum piece. 
1951. Konrad Knopp gave the following two expansions of z: 


FS 1 m2 (-1)* 1 1 
~=> tan d => 1 tee ‘ 
4 - (acen) one ig awl a +s) 


1953. Kurt Mahler (1903-1988) showed that z is not a Liouville number: A real number 
x is called a Liouville number if for every positive integer n, there exist integers p and q 
with q > 1 and such that 


A Liouville number can thus be approximated ‘quite closely’ by a sequence of rational 
numbers. In 1844, Liouville showed that all Liouville numbers are transcendental. 

1954. S.C. Nicholson and J. Jeenel programmed NORC (Naval Ordnance Research Cal- 
culator) at Dahlgren, Virginia to compute z to 3,092 decimals. For this, they used Machin’s 
formula (18). The run took only 13 minutes. 

1956. John Gurland established that for all positive integers n, 


4n+3 ( (2n)! \? 4 (2n)!! \? 
——~ | ———__] «z< ——_]. (36) 
(2n +1)? \ (2n—-1)! 4n+1\ (2n-1)! 
March 1957. G.E. Felton used the Ferranti Pegasus computer to find 10,021 decimal 
places of z in 33 hours. The program was based on Klingenstierna’s formula 


1 1 1 
x = 32arctan 4 arctan 16 arctan{| —— }. (37) 
10 239 515 


However, a subsequent check revealed that a machine error had occurred, so that ‘only’ 


7,480 decimal places were correct. The run was therefore repeated in May 1958, but the 
correction was not published. 

January 1958. Francois Genuys programmed an IBM 704 at the Paris Data Processing 
Center. He used Machin’s type formula (18). It yielded 10,000 decimal places of z in 1 hour 
and 40 minutes. 

July 1959. Genuys programmed an IBM 704 at the Commissariat 4 l’Energie Atomique 
in Paris to compute z to 16,167 decimal places. He used Machin’s type formula (18). It 
took 4 hours and 30 minutes. 

July 1961. Daniel Shanks (1917-1996) and William Shanks used Stérmer’s formula (34) 
on an IBM 7090 (at the IBM Data Processing Center, New York) to compute z to 100,265 
digits, of which the first 100,000 digits were published by photographically reproducing 
the print-out with 5,000 digits per page. The time required for this computation was 8 
hours and 43 minutes. They also checked the calculations by using Gauss’ formula (31), 
which required 4 hours and 22 minutes. 

1961. Machin’s formula (18) was also the basis of a program run on an IBM 7090 at the 
London Data Center in July 1961, which resulted in 20,000 decimal places and required 
only 39 minutes running time. 
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February 1966. Jean Guilloud and J. Filliatre used an IBM 7030 at the Commissariat 4 
l’Energie Atomique in Paris to obtain an approximation of 7 extending to 250,000 decimal 
places on a STRETCH computer. For this, they used Stérmer’s and Gauss’ formulas (34) 
and (31). It took almost 28 hours. 

February 1967. Guilloud and M. Dichampt used CDC (Control Data Corporation) 
6600 in Paris to approximate 7 to 500,000 decimal places. For this, they used Stérmer’s 
and Gauss’ formulas (34) and (31). The computer that churned out half a million digits 
needed only 26 hours and 40 minutes (plus 1 hour and 30 minutes to convert that final 
result from binary to decimal notation). 

1968. In the Putnam Competition, the first problem was 


22 1 y4(1—x)4 
X= [= *) dx. 
0 


1+2x2 


This integral was known to Mahler in the mid-1960s, and has later appeared in several 
exams. It is also discussed by Borwein, Bailey, and Girgensohn in their book on p.3. 

1971. K.Y. Choong, D.E. Daykin and C.R. Rathbone used 100,000 digits of Daniel Shanks 
and William Shanks (1961) to generate the first 21,230 partial quotients of the continued 
fraction expansion of z. 

1974. Ralph William Gosper, Jr. (born 1943), known as Bill Gosper, is a mathematician 
and programmer. He is best known for the symbolic computation, continued fraction rep- 
resentations of real numbers, Gosper’s algorithm, and Gosper curve. He used a refinement 
of Euler transform on (5) to obtain the series 


1 I. 23 1. 23 3-5 
60 *607-8-3 *607-8-310-11-3 
1 2:3 3-5 4:7 
*607-8-310-11-313-14-3 


mw=3+ 


Stee, 


1974. Guilloud with Martine Bouyer (Paris) used formulas (34) and (31) ona CDC 7600 
to compute z to 1,000,250 digits. The run time required for this computation was 23 
hours and 18 minutes, of which 1 hour 7 minutes was used to convert the final result from 
binary to decimal. Results of statistical tests, which generally support the conjecture that 
x is simply normal (in 1909, Félix Edouard Justin Emil Borel (1871-1956) defined: A real 
number a is simply normal in base d if in its representation in base b all digits occur, in an 
asymptotic sense, equally often) were also performed. 

1974. Louis Comtet developed the following Euler’s type expansion of z: 


1976. Richard Brent and Eugene Salamin independently discovered an algorithm which 
is based on an arithmetic-geometric mean and modifies slightly Gauss-Legendre algo- 
rithm. Set ao = 1, bo = 1/2 and so = 1/2. For k = 1,2,3,... compute 


a1 t+ dea 
a a 


bp = Vad, 
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ce =a — b, (38) 
Sk = Spa — 2c, 


2 
PR= a, 

Then px converges quadratically to z, i.e., each iteration doubles the number of accurate 
digits. In fact, successive iterations must produce 1, 4, 9, 20, 42, 85, 173, 347 and 697 
correct digits of 7. The twenty-fifth iteration must produce 45 million correct decimal 
digits of z. 

1981. Kazunori Miyoshi and Kazuhika Nakayama of the University of Tsukuba, Japan 
calculated z to 2,000,038 significant figures in 137.30 hours on a FACOM M-200 com- 
puter. They used Klingenstierna’s formula (37) and checked their result with Machin’s for- 
mula (18). 

1981. Guilloud computed 2,000,050 decimal digits of z. 

1981. Rajan Srinivasan Mahadevan (born 1957) recited from memory the first 31,811 
digits of x. This secured him a place in the 1984 Guinness Book of World Records, and he 
has been featured on Larry King Live and Reader’s Digest. 

1982. Kikuo Takano (1927-2006) was a Japanese poet and mathematician. He developed 
the following Machin-like formula for calculating z: 


1 afl afl af 1 = 1 
— =12tan™ (| — }) +32tan~{ — } —5tan| —— } +12tan . (39) 
4 49 57 239 110,443 


1982. Yoshiaki Tamura on MELCOM 900II computed 2,097,144 decimal places of z. 
For this, he used the Salamin-Brent algorithm (38). 

1982. Yoshiaki Tamura and Yasumasa Kanada (born 1948, life-long ‘pi digit-hunter; set 
the record 11 of the past 21 times) on HITAC M-280H computed 4,194,288 decimal places 
of z. For this, they used the Salamin-Brent algorithm (38). 

1982. Yoshiaki Tamura and Yasumasa Kanada on HITAC M-280H computed 8,388,576 
decimal places of z. For this, they used the Salamin-Brent algorithm (38). 


October 1983. Yasumasa Kanada, Yoshiaki Tamura, Sayaka Yoshino and Yasunori 
Ushiro on HITAC S-810/20 computed 10,013,395 decimal places of z. For this, they used 
the Salamin-Brent algorithm (38). In this work to gather evidence that z is simply nor- 
mal, they also performed statistical analysis. It showed expected behavior. In the first 
ten million digits, the frequencies for each ten digits are 999,440; 999,333; 1,000,306; 
999,964; 1,001,093; 1,000,466; 999,337; 1,000,207; 999,814; and 1,000,040. Further, the 
rate at which the relative frequencies approach 1/10 agrees with theory. As an exam- 
ple, for the digit 7 relative frequencies in the first 10, i = 0,1,2,3,4,5,6,7 digits are 
0, 0.08, 0.095, 0.097, 0.10025, 0.0998, 0.1000207, which seem to be approaching 1/10 at 
rate predicted by the probability theory for random digits, i.e., a speed approximately pro- 
portional to 1/,/7. But this is far from a formal proof of simple normalcy perhaps for a 
proof the current mathematics is not sufficiently developed. In spite of the fact that the 
digits of z pass statistical tests for randomness, 7 contains some sequences of digits that, 
to some, may appear non-random, such as Feynman point, which is a sequence of six con- 
secutive 9s that begins at the 762nd decimal place. A number is said to be norma if all 
blocks of digits of the same length occur with equal frequency. Mathematicians expect 2 
to be normal, so that every pattern possible eventually will occur in the digits of z. 
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1983. Yasumasa Kanada, Sayaka Yoshino and Yoshiaki Tamura on HITAC M-280H 
computed 16,777,206 decimal places of z. For this, they used the Salamin-Brent algo- 
rithm (38). 

1984. Jonathan Borwein and Peter Borwein gave the following algorithm. Set x» = /2, 
yo =O and ap = 2+ /2. Iterate 


Hk = (Sx + 1//%x)/2, 
1 
Sex Vii( =m i; (40) 


Vk + Xk 


(; + “1 
Ak+1 = AkVk+1 . 
14+ Ve 


Then a, converges to m quartically. The algorithm is not self-correcting; each iteration 
must be performed with the desired number of correct digits of z. 
1984. Morris Newman and Daniel Shanks proved the following: Set 


1,071 3 / 
a= os + 92/734 + 3 255,349 + 43,792.34, 


_ 1,533 


b 


+ 133/34 + sv 4,817,509 + 826,196 /34, 
c= 429 + 30472 + 2 92,218 + 65,208V2, 
d= = + 22172 + av 783,853 + 554,268V2, 

then 


1 <7.4x 107°, 


In(2abcd) 


3,502 


1985. Gosper used Symbolics 3,670 and Ramanujan’s formula (35) to compute z to 
17,526,200 decimal digits. 

1985. Jonathan Borwein and Peter Borwein gave the following algorithm. Set dp = 6 — 
4/2 and Yo = /2 -1. Iterate 


-a-yf)"4 
14+ (1—yA)¥4’ (41) 


Any = a(1 + Yeu)* — 23 yi (1 + Via + Yeu)» 


Vk+i = 


Then ax converges quartically to 1/7, i.e., each iteration approximately quadruples the 
number of correct digits. 
1985. The following is not an identity, but is correct to over 42 billion digits 


83 2 
(e x se) me; 


n=-00 


1985. Carl Sagan in his novel deals with the theme of contact between humanity and a 
more technologically advanced, extraterrestrial life form. He suggests that the creator of 
the universe buried a message deep within the digits of 7. 
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January 1986. David H. Bailey used Borweins’ algorithms (40) and (41) on CRAY-2 to 
compute 29,360,111 decimal places of z. 

September 1986. Yasumasa Kanada and Yoshiaki Tamura on HITAC S-810/20 com- 
putes 33,554,414 decimal places of z. For this, they used algorithms (38) and (41). 

October 1986. Yasumasa Kanada and Yoshiaki Tamura on HITAC S-810/20 computed 
67,108,839 decimal places of z. For this, they used algorithm (38). 

January 1987. Yasumasa Kanada, Yoshiaki Tamura, Yoshinobu Kubo and others on 
NEC SX-2 computed 134,214,700 decimal places of z. For this they used algorithms (38) 
and (41). 

1987. Jonathan Borwein and Peter Borwein gave the following algorithm. Set xo = 2, 
yy = 2/4 and po = 2 + 2". Iterate 


he. ay 
_ 12, gl? 
Xk = 5 ea +) ), 
i. , 0/9 
‘p= Vk-1% py FH 
M-atd 
xe +1 
Pk = Pk-1 : 
Ye+1 


Then p; decreases monotonically to z and |p, — | < 10-2" fork > 4. 

1987. Hideaki Tomoyori (born 1932) recited x from memory to 40,000 places taking 
17 hours 21 minutes, including breaks totaling 4 hours 15 minutes, at Tsukuba University 
Club House. 

January 1988. Yasumasa Kanada on HITAC S-820/80 computed 201,326,551 decimal 
places of z. For this, he used algorithms (38) and (41). 

1988. Jonathan Borwein and Peter Borwein developed the series 


1 (-1)"(6n)! (A + Bn) 
> 12>) (n!)3(3n)! Cmte? 


where 


A = 212,175,710,912V/61 + 1,657,145,277,365, 
B = 13,773,980,892,672V 61 + 107,578,229,802,750, 
C = [5,280(236,674 + 30,303V61)]. 


Each additional term of the series adds roughly 31 digits. 
1988. Dario Castellanos gave the following approximation: 


_ (72 


1/5 
= 3.1415926541.... 
254 


May 1989. David Volfovich Chudnovsky (born 1947) and Gregory Volfovich Chud- 
novsky (born 1952) have published hundreds of research papers and books on number 
theory and mathematical physics. Gregory solved Hilbert’s tenth problem at the age of 17. 
They on CRAY-2 and IBM 3090/VF computed 480,000,000 decimal places of z. 
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June 1989. David and Gregory Chudnovsky on IBM 3090 computed 535,339,270 dec- 
imal places of z. 

July 1989. Yasumasa Kanada and Yoshiaki Tamura on HITAC S-820/80 computed 
536,870,898 decimal places of z. For this, they used algorithm (38). 

August 1989. David and Gregory Chudnovsky developed the following rapidly conver- 
gent generalized hypergeometric series: 


foe) 
, (6n)! 13,591,409 + 545,140,134n 
= 12-0" (42) 


n)3(3n)! (640,3203)"+1/2 


Each additional term of the series adds roughly 15 digits. This series is an improved version 
to that of Ramanujan’s (35). It was used by the Chudnovsky brothers to calculate more than 
one billion (to be exact 1,011,196,691) digits on IBM 3090. 

November 1989. Yasumasa Kanada and Yoshiaki Tamura on HITAC S-820/80 com- 
puted 1,073,74.0,799 decimal places of z. For this, they used algorithms (38) and (41). 

August 1991. David and Gregory Chudnovsky used a home made parallel computer 
(they called it m zero, where m stands for machine, and zero for the success) to obtain 
2,260,000,000 decimal places of z. For this they used series (42). 

1991. David Boll discovered an occurrence of z in the Mandelbrot set fractal. 

1991. Jonathan Borwein and Peter Borwein improved on the Salamin-Brent algorithm 
(38). Set ap = 1/3 and sp = (3 — 1)/2. Iterate 


3 


ka = —— 
1+ 20—83)18 


kit —1 
2 


Sk+1 = , 


_ 2 k(,2 
kt = Vek — 3 (an = 1). 


Then 1/a, converges cubically to 7, i.e., each iteration approximately triples the number 
of correct digits. 
Among the several other known iterative schemes, we list the following two which are 


easy to implement on a computer: Set do = 1/2 and sp = 5(./5 — 2). Iterate 


Xn+1 = 5/8 -L 


Yn = (nt _ 1)? +7, 


1 1/5 
Zn+1 = (Jon (Yuu + V Yast = a) , 


5 8-5 
Ans1 = S,Ay — 5" “a + 4/5n(s? — 2s, +5) }, 


25 


Zn+1 + XnsilZns1 + 1P°sy 


Sn41 = ( 


Then ax converges quintically to 1/7, i.e., each iteration approximately quintuples the 


number of correct digits, and 0 <a, —1/m <16-5"- eo" rn, 
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Set do = 1/3, ro = (V3 — 1)/2 and so = (1— 2)". Iterate 


to =1+ 2ry, 


Uns = (9 n(1 + Pn + Py, 


2 2 
Vos = bia + Ens Uns + Una 


27(1 + Sy +S?) 
Wri a a nc 
n+1 


2n-1 
On = Wry1dn + 3 - (1 — Wrst); 
5 (1 Tn) 
ntl = 7? 7 
(tsi + Uns) Vine 


1/3 
Tal = (1 a Sia) 
Then a converges nonically to 1/7, i.e., each iteration approximately multiplies the num- 
ber of correct digits by nine. 

1993. Jonathan Borwein and Peter Borwein developed the series 


=a (6m)! A+mB 
x < (3m) !(m!)3_ C3m 


n= 


where 


A = 63,365,028,312,971,999,585,426,220 
+ 28,337,702,14.0,800,842,046,825,600V5 
+ 384-+/5(10,891,728,551,171,178,200,467,436,212,395,209,160,385,656,017 
+ 4,870,929,086,578,810,225,077,338,534,541,688,721,351,255,040/5) _ , 
B=7,849,910,453,496,627,210,289,749,000 
+ 3,510,586,678,260,932,028,965,606,400V/5 
+ 2,515,968x/3,110(6,260,208,323,789,001,636,993,322,654,444,020,882,161 


1/2 
+ 2,799,650,273,060,4.44,296,577,206,890,718,825,190,235/5) 


and 


C = -214,772,995,063,512,240 — 96,049,4.03,338,648,032V5 


- 1,296/5(10,985,234,579,463,550,323,713,318,473 


1/2 
+ 4,912,74.6,253,692,362,754,607,395,912/5) : 


Each additional term of the series adds approximately 50 digits. However, computation of 
this series on a computer does not seem to be easy. 

May 1994. David and Gregory Chudnovsky used a home made parallel computer m 
zero to obtain 4,044,000,000 decimal places of z. For this they used series (42). 
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June 1995. Yasumasa Kanada and Daisuke Takahashi on HITAC S-3800/480 (dual 
CPU) computed 3,221,225,466 decimal places of z. For this, they used algorithms (38) 
and (41). 

August 1995. Yasumasa Kanada and Daisuke Takahashi on HITAC S-3800/480 (dual 
CPU) computed 4,294,967,286 decimal places of z. For this, they used algorithms (38) 
and (41). 

October 1995. Yasumasa Kanada and Daisuke Takahashi on HITAC S-3800/480 (dual 
CPU) computed 6,442,450,938 decimal places of z. For this, they used algorithms (38) 
and (41). 

1995. David Bailey, Peter Borwein and Simon Plouffe developed the following formula 


(known as BBP formula) to compute the mth hexadecimal digit (base 16) of 2 without 
having the previous n — 1 digits 


lee} 
1 4 2 1 1 
T - : 43 
Sees 8m+4 8m+5 ans) ee 


m=0 


To show the validity of (43), for any k < 8, we have 


UV2 yk 1/2 © : 1 2 T 
dx = x —1+8m dx = oe. ——r 
[ 1-x8 [ > Qk/2 > 16” (8m +k) 


therefore 
> 1 4 2 1 1 
5 16! 8m+1 8m+4 8m+5 8m+6 
WVv2 3 4 5 
An/2 — 8x? — 4./2x* — 8x 
7 / v2 ws ax, (44) 
5 1-28 


Substituting u = ./2x in equation (44), we obtain 


[ 16u - 16 du= [du BH 5g 

9 U4-2u2 +4u-4 0 u2-2 0 uw2-2u+2 

The discovery of this formula came as a surprise. For centuries, it had been assumed that 
there was no way to compute the nth digit of 7 without calculating all of the preceding 
n —1 digits. Since this discovery, many such formulas for other irrational numbers have 
been discovered. Such formulas have been called as spigot algorithms because, like water 
dripping from a spigot, they produce digits that are not reused after they are calculated. 

1996. Simon Plouffe discovered an algorithm for the computation of z in any base. Later 
he expressed regrets for having shared credit for his discovery of this formula with Bailey 
and Borwein. 

March 1996. David and Gregory Chudnovsky used a home made parallel computer m 
zero to obtain 8,000,000,000 decimal places of z. For this, they used series (42). They said 
‘we are looking for the appearance of some rules that will distinguish the digits of x from 
other numbers, i.e., if someone were to give you a million digits from somewhere in 7, 
could you tell it was from 7? The digits of 7 form the most nearly perfect random sequence 
of digits that has ever been discovered. However, each digit appears to be orderly. Ifa single 
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digit in 7 were to be changed anywhere between here and infinity, the resulting number 
would no longer be z,, it would be garbage. Around the three-hundred-millionth decimal 
place of zr, the digits go 88888888-eight eights pop up in a row. Does this mean anything? 
It appears to be random noise. Later, ten sixes erupt: 6666666666. What does this mean? 
Apparently nothing, only more noise. Somewhere past the half-million mark appears the 
string 123456789. It is an accident, as it were. We do not have a good, clear, crystallized 
idea of randomness. It cannot be that z is truly random. Actually, a truly random sequence 
of numbers has not yet been discovered’ 


1996. Gosper posted the following fascinating formula 


2n 
TT 
lim | |] ——— = 4!" =1.554682.... 
eat 2tan-1m 


April 1997. Yasumasa Kanada and Daisuke Takahashi on HITACHI SR2201 (1,024 
CPU) computed 17,179,869,142 decimal places of z. For this, they used algorithms (38) 
and (41). 

July 1997. Yasumasa Kanada and Daisuke Takahashi on HITACHI SR2201 (1,024 CPU) 
computed 51,539,600,000 decimal places of 2. The computation tool just over 29 hours, 
at an average rate of nearly 500,000 digits per second. For this, they used algorithms (38) 
and (41). 

1997. Fabrice Bellard developed the following formula: 


1 (-1)”" 
ee qlom 
m=0 
2° 1 2° 2° 2? 2? 1 
- + = - + 
4m+1 4m+3 10m+1 10m+3 10m+5 10m+7 10m+9 


which can used to compute the nth digit of 7 in base 2. It is about 43% faster then (43). 


The following exotic formula is also due to him: 


foe) 


_ 1 3P(m) 
* * 740,025 Ss (5) Qm—1 20279280] 


m=1 \2m 


where 


P(m) = -885,673,181m° + 3,125,347,237m' — 2,942,969,225m" 


+ 1,031,962,795m” — 196,882,274m + 10,996,648. 


April 1999. Yasumasa Kanada and Daisuke Takahashi on HITACHI SR8000 (64 of 128 
nodes) computed 68,719,470,000 decimal places of z. For this, they used algorithms (38) 
and (41). 

September 1999. Yasumasa Kanada and Daisuke Takahashi on HITACHI SR8000/MPP 
(128 nodes) computed 206,158,430,000, i.e., 206 billion decimal places of z. For this, they 
used algorithms (38) and (41). 
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1999. Leo Jerome Lange developed the following continued fraction of z: 


72 
Gass: 


6+ 


2000. J. Munkhammar gave the following formula which is related to Viéte’s (6): 


x = lim 2”*1 
n->oo 


A Dee PR rere 
2- - ; 


which as a recurrence relation can be written as 7 = limy-+9 2”*!a,, where dy = /2, and 


1 2 1 242 
ay = yt +/1-,/1- yt . 


Another closely related formula is 


sata SIC) 


m=1 


2001. Robert Palais believes that the notation z is wrongly used right from the begin- 
ning. According to him, some suitable symbol (now popular as tau T) must have been used 
for 27. He justifies his claim by giving several formulas where t appears naturally rather 
than just 2. For some people, June 28, is Tau’s Day and they celebrate. 

November 2002. Yasumasa Kanada used Machin-like formulas (33) and (39) to com- 
pute the value of z to 1,241,100,000,000 decimal places. The calculation took more than 
600 hours on 64 nodes of a HITACHI SR8000/MPP supercomputer. The work was done 
at the Department of Information Science at the University of Tokyo. For this, he used 
arctangent formulas (33) and (39). 

2004. Daniel Tammet, at age 25, recited 22,514 decimal places of 7, scoring the Euro- 
pean record. For an audience at the Museum of the History of Science in Oxford, he said 
these numbers aloud for 5 hours and 9 minutes. Unfortunately, he made his first mistake 
at position 2,965 and did not correct this error immediately and without outside help, but 
only after he was told that there was a mistake. 

2005. Stephen K. Lucas found that 


355 [ x8 (1 — x)8(25 + 816x7) 
T= 
13 Jo 3,164(1 + x2) 


Several other integral formulas of this type are known, here we give the following: 


y 


741,269,838,109 i xl6(1 — x6 
T= 
235,953,517,800 Jo 64(1+.x?) 
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which gives 3.14159265358955 < z. If we substitute « = 1 in the above integral and note 
that 


—x!6(1 — x)® dx = ee 
o 128 2,538,963,567,360 
then it follows that mz < 3.14159265358996. 

November 2005. Chao Lu, a chemistry student, at age 23, broke the Guiness record by 
reciting 7 from memory to 67,890 places. For this, he practiced for 4 years. The attempt 
lasted 24 hrs 4 min and was recorded on 26 video tapes. The attempt was witnessed by 8 
officials, math professors and 20 students. 

2005. Kate Bush in the song z (in her album Aerial) sings the number to its 137th dec- 
imal place (though she omits the 79th to 100th decimal places). 

October 2006. Akira Haraguchi a retired engineer from Chiba recited 2 from memory 
to 54,000 digits in September 2004, 68,000 digits in December 2004, 83,431 digits in July 
2005, and 100,000 digits in October 2006. He accomplished the last recitation in 16 1/2- 
hours in Tokyo. He says memorization of the digits of z is ‘the religion of the universe’ 

2006. Simon Plouffe found the following curious formula: 


= 1 
am 2) rea aw = —96 > k(e 2k _ rie k(etk= -1) 


2008. In Midnight (tenth episode of the fourth series of British science fiction tele- 
vision series Doctor Who), the character, the businesswoman, Sky Silvestry mimics 
the speech of The Doctor by repeating the square root of 2 to 30 decimal places 
1.77245385090551602729816748334. 

2008. Syamal K. Sen and Ravi P. Agarwal suggested four Matlab based procedures, 
viz, (i) Exhaustive search, (ii) Principal convergents of continued fraction based proce- 
dure, (iii) Best rounding procedure for decimal (rational) approximation, and (iv) Contin- 
ued fraction based algorithm with intermediate convergents. While the first procedure is 
exponential-time, the remaining three are polynomial-time. Roughly speaking, they have 
demonstrated that the absolute best k-digit rational approximation of z will be as good as 
2k-digit decimal approximation of z. The absolute best k-digit rational approximation is 
most desired for error-free computation involving  /any other irrational number. 

2008. Syamal K. Sen, Ravi P. Agarwal and Ghoolam A. Shaykhian have demonstrated 
through numerical experiment using Matlab that z has always scored over @ (golden ra- 
tio), as a source of uniformly distributed random numbers, statistically in one-dimensional 
Monte Carlo (M.C.) integration; whether z fares better than ¢ for double, triple and higher 
dimensional M.C. integration or not deserves exploration. 

2009. Syamal K. Sen, Ravi P. Agarwal and Ghoolam A. Shaykhian compared the four 
procedures they proposed in (2008) for computing best k-digit rational approximations 
of irrational numbers in terms of quality (error) and cost (complexity). They have stressed 
on the fact that ultra-high-speed computing along with abundance of unused computing 
power allows employing an exponential-time algorithm for most real-world problems. 
This obviates the need for acquiring and employing the mathematical knowledge involving 
principal and intermediate convergents computed using a polynomial-time algorithm for 
practical problems. Since z is the most used irrational number in the physical world, the 
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simple concise Matlab program would do the job wherever z /any other irrational number 
is involved. 

2009. Syamal K. Sen, Ravi P. Agarwal and Raffela Pavani have provided, using Matlab, 
the best possible rational bounds bracketing z/any irrational number with absolute er- 
ror and the time complexity involved. Any better bounds are impossible. In these rational 
bounds, either the lower bound or the upper bound will always be the absolute best ratio- 
nal approximation. The absolute error computed provides the overall error bounds in an 
error-free computational environment involving z /any other irrational number. 

2009. Tue N. Vu has given Machin-type formula (http://seriesmathstudy.com/sms/ 
machintypetv): For each positive integer n, 


1 af 22 afl z z 1 5 2 
—=t t t ——, t ——_, é 
Aen (=) sacs (— y> un 2(2 +k)? oe (3 + 2k) 


2009. Cetin Hakimoglu-Brown developed the following expansion: 


oo. aa 127,169 ~=—1,070 131 2 
= = = + F 
~ 65 aa aM 12k+1 9 12k+5 12k+7 12k+11 


which can be written as 


, 


vB 3 (4k)\(671,840K? + 1,289,936K2 + 782,458k + 150,835) 
~ 4156S (72)#**1 (13/12), (17/12), (19/12), (23/12), 


where (x), = «(x + 1)(x + 2)---(« + kK — 1) is the Pochhammer notation. He also gave the 
expansion 


5,717 413 45 D 
-mab @) (a 8k+3 8k+5 a): 

August 2009. Daisuke Takahashi et al. used a massive parallel computer called the T2K 
Tsukuba System to compute 7 to 2,576,980,377,524 decimal places in 73 hours 36 min- 
utes. For this, they used algorithms (38) and (41). 

December 2009. Fabrice Bellard used Chudnovsky brothers series (42) to compute 
2,699,999,990,000, i.e., 2.7 trillion decimal places of z in 131 days. For this, he used a 
single desktop PC, costing less than $3,000. 

August 2010. Shigeru Kondo and Alexander J. Yee used Chudnovsky brothers series 
(42) to compute 5,000,000,000,000, ie., 5 trillion decimal places of x in 90 days. For 
this, they used a server-class machine running dual Intel Xeons, equipped with 96 GB of 
RAM. 

2010. Michael Keith used 10,000 digits of 2 to establish a new form of constrained 
writing, where the word lengths are required to represent the digits of 2. His book contains 
a collection of poetry, short stories, a play, a movie script, crossword puzzles and other 
surprises. 

2011. Syamal K. Sen and Ravi P. Agarwal in their monograph systematically organized 
their work of 2008 and 2009 on z and other irrational numbers. They also included sev- 
eral examples to illustrate the importance of their findings. 
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2011. During the auction for Nortel’s portfolio of valuable technology patents, Google 
made a series of strange bids based on mathematical and scientific constants, including 7. 

October 2011. Shigeru Kondo and Alexander J. Yee used Chudnovsky brothers series 
(42) to compute 10,000,000,000,050, i.e., 10 trillion decimal places of 7 in 371 days. 

2011. Cristinel Mortici improved Gurland’s bounds (36) to a, < 7 < By, > 1 where 


n+i 9 45 (2n) \? 
a= + - 
""Xnr+ins 5° 2,048n5 — 8,192n6 }\ (2n- 1)! 


and 


n+ t 9 (2n)!! 2 
Bn = T - 3 t 5 ; 
m+snt= 2,048n (2n —1)! 


It follows that 


Oy, = + (=) and B,=m+ (=). 
n n 


2012. Long Lin has improved Mortici’s bounds to Ay, < 1 < dy, n> 1 where 


1 3 3 3 33 39 2 (Qn) \* 
An= {1+ - + + - 
4n 32n2 128n? 2,048n* 8,192n> 65,536n° ) 2n + 1\ (2n- DI! 


and 


tat 3 3 3 2 (2n) \? 
a + + + ¢ 
He 4n 32n? 128n3 2,048n4*/ 2n+1\(2n-1)! 


It follows that 


1 1 
n= +0(<) and n=1+0( =). 
n n 


He has also obtained the higher order bounds 6, < a < @,, 1 > 1 where 


: 1 1 1 1 17 31 (Qn)! \? 
= —ex + + - 
"n P 4n 96n3 320n> = 7,168n7 = 9,216? J} \ (2n —1)!! 


and 


1 1 1 1 17 (2n) \* 
4 = —eXpy— + - + 7 
Be Pan 96n3 32005 7,168n7 (2n 1)! 


It follows that 


1 1 
s,=2 +0( =) and n= +0( 5). 
n n 
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Conclusions 

No number system can capture m exactly. We are deeply and almost completely involved 
in the conventional decimal number system in representing any real quantity. This is not 
the only number system for the representation. There are other number systems such 
as binary, octal, hexadecimal, binary-coded decimal, negative radix, p-adic and modular 
number systems. If the circumference of a circle is exactly represented, then its diameter 
will not have exact representation and vice versa. 

Reading the mathematicians in pre-computer days. An important focus of this paper is 
that the reader besides, however, knowing the usual chronology of the events in the life of 
z, could get a feel and also read how the mind of a mathematician has been working when 
he ponders over z either independently without much knowledge/concern of what has 
been done in the past or with considerable knowledge of the work done by his predeces- 
sors. Hyper-computers (108 flops) of 2012 were completely non-existence and even be- 
yond the imagination of all the mathematicians/scientists until almost the mid-twentieth 
century. Also, publication machinery was too poor until the beginning of the twentieth 
century. Consequently, all the work on z that has been carried out during thousands of 
years prior to the twentieth century was not a monotonic improvement in the z value 
as well as in the exploration of its wonderful character. Many have worked on z stand- 
alone while others have contributed with some prior knowledge of the earlier work. All 
of them were severely handicapped due to the non-existence of today’s ultra-high speed 
computers. They entirely depended on their ingenuity and on whatever negligible com- 
puting device they had. It is really interesting under this environment to read these scien- 
tists/mathematicians and realize how fortunate we are in the gigantic computer age. All 
that has been done during the last 20 years (1990-2010) amounts to much more than what 
has been achieved during the past several millennia. 

Matlab is well-suited to check/evaluate merits of all past x formulas. Widely used user- 
friendly Matlab that needs no formal programming knowledge along with the vpa (variable 
precision arithmetic) and format long g commands can be used to easily and readily check 
all that has been done during the past several thousand years and possibly appreciate the 
inherent intellectual import of the bygone scientists (having practically no computing de- 
vice) and their expected pitfalls, bias and incorrect beliefs. 

Checking exactness of billions of digits of m is difficult. Are all the billions of digit of x 
computed 100% error-free? We are familiar with the age-old proverb that “To err is hu- 
man (living being): Maybe a new proverb ‘Not to err is computer (non-living being)’ can 
be taken as true in the modern computer age. Here, ‘err’ means mistake. The arithmetic 
operations, particularly subtraction operations of two nearly equal numbers, involved in 
a formula could be sometimes error introducer. However, different computers with dif- 
ferent formulas used to compute z would help verification and obviate possible error in 
computation. 

Computing nth decimal digit exactly always without preceding digits seems yet an open 
computational problem. While probabilistically one may determine the nth digit of x with- 
out computing the preceding n —1 digits, obtaining nth digit exactly (correctly) always for 
any 1 does not seem to be possible without a large precision. It seems yet an open com- 
putational problem that needs exploration. Thus, formulas such as (43) seem more of 


theoretical/academic interest than of practical usage as of now. 
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PI for testing performance and stability of a computer. Super PI is a computer program 
that calculates m to a specified number of digits after decimal point up to a maximum 
of 32 million digits. It uses the Gauss-Legendre algorithm and is a Windows port of the 
program used by Yasumasa Kanada in 1995 to compute z to 2* digits. Super PI is used by 
many overclockers to test the performance and stability of their computers. Overclocking 
is the process of making a computer run faster than the clock frequency specified by the 
manufacturer by modifying system parameters. 


Competing interests 
The authors declare that they have no competing interests. 


Authors’ contributions 
All authors contributed equally and read and approved the final manuscript. 


Author details 

'Department of Mathematics, Texas A&M University-Kingsville, Kingsville, TX, 78363, USA. 71540 Ravena Street, 
Bethlehem, PA 18015, USA. ?GVP-Prof. V. Lakshmikantham Institute for Advanced Studies, #1-83-21/3, Sector 8, MVP 
Colony, Visakhapatnam, A-P, India. 


Acknowledgements 
Dedicated to V Lakshmikantham (1924-2012). 


Received: 22 January 2013 Accepted: 25 March 2013 Published: 11 April 2013 


References 
1. Adamchik, V, Wagon, S: A simple formula for 7. Am. Math. Mon. 104, 852-855 (1997) 

2. Adamchik, V, Wagon, S: Pi: a 2000-year search changes direction. Educ. Res. 5, 11-19 (1996) 

3. Ahmad, A: On the zr of Aryabhata |. Ganita Bharati 3, 83-85 (1981) 

4. Akira, H: History of zr. Kyoiku Tosho, Osaka (1980) 

5. Al-Kashi, J: Treatise on the Circumference of the Circle (1424) 

6. Almkvist, G: Many correct digits of zr, revisited. Am. Math. Mon. 104, 351-353 (1997) 

7. Anderson, DV: A polynomial for 7. Math. Gaz. 55, 67-68 (1971) 

8. Anonymous: Cyclometry and Circle-Squaring in a Nutshell. Simpkin, Marshall & Co., Stationer’s Hall Court, London 
(1871) 

9. Arndt, J: Cryptic Pi related formulas. http://www jj.de/hfloat/pise.dvi 

10. Arndt, J, Haenel, C: 2-Unleashed. Springer, Berlin (2000) 

11. Assmus, EF: Pi. Am. Math. Mon. 92, 213-214 (1985) 

12. Backhouse, N: Note 79.36. Pancake functions and approximations to 7. Math. Gaz. 79, 371-374 (1995) 

13. Badger, L: Lazzarini’s lucky approximation of 7. Math. Mag. 67, 83-91 (1994) 

14. Bai, S: An exploration of Liu Xin’s value of zr from Wang Mang's measuring vessel. Sugaku-shi Kenkyu 116, 24-31 
(1988) 

15. Bailey, DH: Numerical results on the transcendence of constants involving 77, e, and Euler's constant. Math. Comput. 
50, 275-281 (1988) 

16. Bailey, DH: The computation of to 29,360,000 decimal digits using Borweins’ quartically convergent algorithm. 
Math. Comput. 50, 283-296 (1988) 

17. Bailey, DH, Borwein, JM, Borwein, PB, Plouffe, S: The quest for pi. Math. Intell. 19, 50-57 (1997) 

18. Bailey, DH, Borwein, PB, Plouffe, S: On the rapid computation of various polylogarithmic constants. Math. Comput. 
66, 903-913 (1997) 

19. Beck, G, Trott, M: Calculating Pi from antiquity to modern times. http://library.wolfram.com/infocenter/Demos/107/ 

20. Beckmann, P: A History of zr. St Martin's, New York (1971) 

21. Bellard, F: Fabrice Bellard’s, Pi page. http://bellard.org/pi/ 

22. Berggren, L, Borwein, JM, Borwein, PB: Pi: A Source Book, 3rd edn. Springer, New York (2004) 

23. Beukers, F: A rational approximation to mr. Nieuw Arch. Wiskd. 5, 372-379 (2000) 

24. Blatner, D: The Joy of IT. Penguin, Toronto (1997) 

25. Bokhari, N: Piece of Pi. Dandy Lion, San Luis Obispo (2001) 

26. Boll, D: Pi and the Mandelbrot set. http://www.pi314.net/eng/mandelbrot.php 

27. Borwein, JM, Bailey, DH, Girgensohn, R: Experimentation in Mathematics: Computational Paths to Discovery. 
AK Peters, Wellesley (2004) 

28. Borwein, JM, Borwein, PB: A very rapidly convergent product expansion for zr. BIT Numer. Math. 23, 538-540 (1983) 

29. Borwein, JM, Borwein, PB: Cubic and higher order algorithms for mr. Can. Math. Bull. 27, 436-443 (1984) 

30. Borwein, JM, Borwein, PB: Explicit algebraic nth order approximations to 7. In: Singh, SP, Burry, JHW, Watson, B (eds.) 
Approximation Theory and Spline Functions, pp. 247-256. Reidel, Dordrecht (1984) 

31. Borwein, JM, Borwein, PB: The arithmetic-geometric mean and fast computation of elementary functions. SIAM Rev. 
26, 351-365 (1984) 

32. Borwein, JM, Borwein, PB: An explicit cubic iteration for z. BIT Numer. Math. 26, 123-126 (1986) 

33. Borwein, JM, Borwein, PB: More quadratically converging algorithms for a. Math. Comput. 46, 247-253 (1986) 

34. Borwein, JM, Borwein, PB: Pi and the AGM - A Study in Analytic Number Theory and Computational Complexity. 
Wiley-Interscience, New York (1987) 


420 22. BIRTH, GROWTH AND COMPUTATION OF PI TO TEN TRILLION DIGITS (2013) 


Agarwal et al. Advances in Difference Equations 2013, 2013:100 
http://www.advancesindifferenceequations.com/content/2013/1/100 


35. 


36. 
37. 
38. 


39. 


40. 


41. 


42. 


43. 


44, 
45, 


46. 
47. 


48. 


49, 
50. 
51. 
52. 
53. 
54. 
55s 
56. 


BY A 


58. 
59. 


60. 
61. 


62. 
63. 
64. 
65. 
66. 
67. 
68. 
69. 
70. 
Li, 
72. 
73. 
74. 
75. 
76. 
77. 
78. 
79. 
80. 
81. 
82. 
83. 
84. 
85. 
86. 
87. 
88. 
89. 
90. 
91. 
92. 


Borwein, JM, Borwein, 
Sci. 97, 53-59 (1987) 
Borwein, JM, Borwein, 
Borwein, JM, Borwein, 
Borwein, JM, Borwein, 
Press, Boston (1988) 
Borwein, JM, Borwein, 
(1989) 

Borwein, JM, Borwein, 
(1993) 

Borwein, JM, Borwein, 
one billion digits of . 
Borwein, JM, Borwein, 


B: Explicit Ramanujan-type approximations to z of high order. Proc. Indian Acad. Sci. Math. 
B: Ramanujan’s rational and algebraic series for 1/z. J. Indian Math. Soc. 51, 147-160 (1987) 
B: Ramanujan and zr. Sci. Am. 258, 112-117 (1988) 

B: More Ramanujan-type series for 1/zr. In: Ramanujan Revisited, pp. 359-374. Academic 

B: Approximating a with Ramanujan’s modular equations. Rocky Mt. J. Math. 19, 93-102 

B: Class number three Ramanujan type series for 1/zr. J. Comput. Appl. Math. 46, 281-290 

B, Bailey, DH: Ramanujan, modular equations, and approximations to 7, or how to compute 


Am. Math. Mon. 96, 201-219 (1989) 
B, Dilcher, K: Pi, Euler numbers, and asymptotic expansions. Am. Math. Mon. 96, 681-687 


(1989) 
Borwein, JM, Borwein, PB, Garvan, F: Hypergeometric analogues of the arithmetic-geometric mean iteration. Constr. 
Approx. 9, 509-523 (1993) 

Borwein, PM: The amazing number II. Nieuw Arch. Wiskd. 1, 254-258 (2000) 

Brent, RP: The complexity of multiple-precision arithmetic. In: Andressen, RS, Brent, RP (eds.) Complexity of 
Computational Problem Solving. University of Queensland Press, Brisbane (1976) 

Brent, RP: Fast multiple-precision evaluation of elementary functions. J. ACM 23, 242-251 (1976) 

Breuer, S, Zwas, G: Mathematical-educational aspects of the computation of zr. Int. J. Math. Educ. Sci. Technol. 15, 
231-244 (1984) 
Brown, CH: An algorithm for the derivation of rapidly converging infinite series for universal mathematical 
constants. Preprint (2009) 
Bruins, EM: With roots towards Aryabhata’s zr-value. Ganita Bharati 5, 1-7 (1983) 

Carlson, BC: Algorithms involving arithmetic and geometric means. Am. Math. Mon. 78, 496-505 (1971) 

Castellanos, D: The ubiquitous pi, part |. Math. Mag. 61, 67-98 (1988) 

Castellanos, D: The ubiquitous pi, part Il. Math. Mag. 61, 148-163 (1988) 

Chan, J: As easy as Pi. Math Horizons, Winter 1993, 18-19 

Choong, KY, Daykin, DE, Rathbone, CR: Rational approximations to 2. Math. Comput. 25, 387-392 (1971) 

Choong, KY, Daykin, DE, Rathbone, CR: Regular continued fractions for x and y. Math. Comput. 25, 403 (1971) 
Chudnovsky, DV, Chudnovsky, GV: Approximations and complex multiplication according to Ramanujan. In: 
Ramanujan Revisited, pp. 375-396 & 468-472. Academic Press, Boston, (1988) 

Chudnovsky, DV, Chudnovsky, GV: The computation of classical constants. Proc. Natl. Acad. Sci. USA 86, 8178-8182 
(1989) 

Cohen, GL, Shannon, AG: John Ward's method for the calculation of zr. Hist. Math. 8, 133-144 (1981) 

Colzani, L: La quadratura del cerchio e dell'iperbole (The squaring of the circle and hyperbola). Universita degli studi 
di Milano-Bicocca, Matematica, Milano, Italy 

Cox, DA: The arithmetic-geometric mean of Gauss. Enseign. Math. 30, 275-330 (1984) 

Dahse, Z: Der Kreis-Umfang fir den Durchmesser 1 auf 200 Decimalstellen berechnet. J. Reine Angew. Math. 27, 198 
(1944) 
Dalzell, DP: On 22/7. J. Lond. Math. Soc. 19, 133-134 (1944) 

Dalzell, DP: On 22/7 and 355/113. Eureka Archimed. J. 34, 10-13 (1971) 

Datta, B: Hindu values of zr. J. Asiat. Soc. Bengal 22, 25-42 (1926) 

Davis, PJ: The Lore of Large Numbers. New Mathematical Library, vol. 6. Math. Assoc. of America, Washington (1961) 
Delahaye, JP: Le Fascinant Nombre z. Bibliothéque Pour la Science, Belin (1997) 

Dixon, R: The story of pi (zr). In: Mathographics. Dover, New York (1991) 

Engels, H: Quadrature of the circle in ancient Egypt. Hist. Math. 4, 137-140 (1977) 

Eymard, P, Lafon, JP: The Number Pi. Am. Math. Soc., Providence (1999) (Translated by S.S. Wilson) 

Ferguson, DF: Evaluation of mr. Are Shanks’ figures correct? Math. Gaz. 30, 89-90 (1946) 

Ferguson, DF: Value of zr. Nature 17, 342 (1946) 

Finch, SR: Mathematical Constants. Cambridge University Press, Cambridge (2003) 

Frisby, E: On the calculation of pi. Messenger Math. 2, 114 (1872) 

Fox, L, Hayes, L: A further helping of zr. Math. Gaz. 59, 38-40 (1975) 

Fuller, R: Circle and Square. Springfield Printing and Binding Co., Springfield (1908) 

Genuys, F: Dix milles décimales de z. Chiffres 1, 17-22 (1958) 

Goggins, JR: Formula for 7/4. Math. Gaz. 57, 134 (1973) 

Goldsmith, C: Calculation of In 2 and zr. Math. Gaz. 55, 434-436 (1971) 

Goodrich, LC: Measurements of the circle in ancient China. Isis 39, 64-65 (1948) 

Gosper, RW: Acceleration of series. Memo no. 304., M.I-T,, Artificial Intelligence Laboratory, Cambridge, Mass. (1974) 
Gosper, RW: math-fun@cs.arizona.edu posting, Sept. (1996) 

Gosper, RW: A product, math-fun@cs.arizona.edu posting, Sept. 27 (1996) 

Gould, SC: What is the value of Pi. Notes and Queries, Manchester, N.H. (1888) 

Gourdon, X, Sebah, P: Collection of series for 7. http://numbers.computation free.fr/Constants/Pi/piSeries.html 
Greenblatt, MH: The ‘legal’ value of 7 and some related mathematical anomalies. Am. Sci. 53, 427A-432A (1965) 
Gregory, RT, Krishnamurthy, EV: Methods and Applications of Error-Free Computation. Springer, New York (1984) 
Gridgeman, NT: Geometric probability and the number zr. Scr. Math. 25, 183-195 (1960) 

Guilloud, J, Bouyer, M: Un Million de Décimales de 2. Commissariat 4 |Energie Atomique, Paris (1974) 

Gupta, RC: Aryabhata I's value of zr. Math. Educ. 7, 17-20 (1973) 

Gupta, RC: Madhava's and other medieval Indian values of zr. Math. Educ. 9, 45-48 (1975) 

Gupta, RC: Some ancient values of pi and their use in India. Math. Educ. 9, 1-5 (1975) 

Gupta, RC: Lindemann's discovery of the transcendence of zr: a centenary tribute. Ganita Bharati 4, 102-108 (1982) 


Page 56 of 59 


22. BIRTH, GROWTH AND COMPUTATION OF PI TO TEN TRILLION DIGITS (2013) 421 


Agarwal et al. Advances in Difference Equations 2013, 2013:100 
http://www.advancesindifferenceequations.com/content/2013/1/100 


93. 
94. 
95. 
96. 
97. 
98. 


99. 
100. 


101. 
102. 
103. 
104. 
105. 
106. 
107. 
108. 
109, 


110. 
111, 
112, 


N13; 


114, 


IS; 
116. 
117. 


118. 
119. 
120. 


121. 


122. 


123. 
124. 
125. 


126. 
127. 
128. 


129; 
130. 
131. 
132. 
133. 
134. 
135. 
136. 
IBZ 
138. 
139, 
140. 
141, 


142. 
143. 


144, 
145. 
146, 


147, 
148. 


Gupta, RC: New Indian values of z from the ‘Manava’sulba sutra. Centaurus 31, 114-125 (1988) 
Gupta, RC: On the values of z from the bible. Ganita Bharati 10, 51-58 (1988) 
Gupta, RC: The value of zr in the ‘Mahabharata’ Ganita Bharati 12, 45-47 (1990) 
Gurland, J: On Wallis’ formula. Am. Math. Mon. 63, 643-645 (1956) 
Hall, A: On an experimental determination of pi. Messenger Math. 2, 113-114 (1873) 
Hata, M: Improvement in the irrationality measures of mr and zr’. Proc. Jpn. Acad., Ser. A, Math. Sci. 68, 283-286 
(1992) 
Hata, M: Rational approximations to 2 and some other numbers. Acta Arith. 63, 335-349 (1993) 
Hayashi, T: The value of zr used by the Japanese mathematicians of the 17th and 18th centuries. In: Bibliotheca 
Mathematics, vol. 3, pp. 273-275 (1902) 
Hayashi, T, Kusuba, T, Yano, M: Indian values for z derived from Aryabhata's value. Hist. Sci. 37, 1-16 (1989) 
Hermann, E: Quadrature of the circle in ancient Egypt. Hist. Math. 4, 137-140 (1977) 
Hobson, EW: Squaring the Circle: A History of the Problem. Cambridge University Press, Cambridge (1913) 
Huygens, C: De circuli magnitudine inventa. Christiani Hugenii Opera Varia, vol. |, pp. 384-388. Leiden (1724) 
Huylebrouck, D: Van Ceulen’s tombstone. Math. Intell. 4, 60-61 (1995) 
Hwang, CL: More Machin-type identities. Math. Gaz. 81, 120-121 (1997) 
Jami, C: Une histoire chinoise du nombre zr. Arch. Hist. Exact Sci. 38, 39-50 (1988) 
Jha, P: Aryabhata | and the value of w. Math. Educ. 16, 54-59 (1982) 
Jha, SK, Jha, M: A study of the value of z known to ancient Hindu and Jaina mathematicians. J. Bihar Math. Soc. 13, 
38-44 (1990) 
Jones, W: Synopsis palmiorum matheseos, London, 263 (1706) 
Jorg, A, Haenel, C: Pi Unleashed, 2nd edn. Springer, Berlin (2000) (Translated by C. Lischka and D. Lischka) 

anada, Y: Vectorization of multiple-precision arithmetic program and 201,326,000 decimal digits of a calculation. 
n: Supercomputing: Science and Applications, vol. 2, pp. 117-128 (1988) 

anada, Y, Tamura, Y, Yoshino, S, Ushiro, Y: Calculation of to 10,013,395 decimal places based on the 
Gauss-Legendre algorithm and Gauss arctangent relation. Technical report 84-01, Computer Center, University of 
Tokyo (1983) 
eith, M: Not a Wake: A Dream Embodying (pi)'s Digits Fully for 10,000 Decimals. Vinculum Press, Baton Rouge 
(2010) (Diana Keith (Illustrator)) 
nopp, K: Theory and Application of Infinite Series. Blackie, London (1951) 
ochansky, AA: Observationes Cyclometricae ad facilitandam Praxin accomodatae. Acta Erud. 4, 394-398 (1685) 
rishnamurhty, EV: Complementary two-way algorithms for negative radix conversions. IEEE Trans. Comput. 20, 
543-550 (1971) 

ulkarni, RP: The value of z¢ known to Sulbasutrakaras. Indian J. Hist. Sci. 13, 32-41 (1978) 
Laczkovich, M: On Lambert's proof of the irrationality of 2. Am. Math. Mon. 104, 439-443 (1997) 
de Lagny, F: Mémoire sur la quadrature du cercle et sur la mesure de tout arc, tout secteur et tout segment donné. 
n: Histoire de LAcadémie Royale des Sciences. Académie des sciences, Paris (1719) 
Lakshmikantham, V, Leela, S, Vasundhara Devi, J: The Origin and History of Mathematics. Cambridge Scientific 
ublishers, Cambridge (2005) 
Lambert, JH: Mémoire sur quelques propriétés remarquables des quantités transcendantes circulaires et 
logarithmiques. In: Mémoires de l'Académie des Sciences de Berlin, vol. 17, pp. 265-322 (1761) 
Lange, LU: An elegant continued fraction for a. Am. Math. Mon. 106, 456-458 (1999) 
Lay-Yong, L, Tian-Se, A: Circle measurements in ancient China. Hist. Math. 13, 325-340 (1986) 
Lazzarini, M: Un’ applicazione del calcolo della probabilita alla ricerca sperimentale di un valore approssimato di zr. 
eriod. Mat. 4, 140-143 (1901) 
Legendre, AM: Eléments de Géométrie. Didot, Paris (1794) 
Lehmer, DH: On arctangent relations for 2. Am. Math. Mon. 45, 657-664 (1938) 
Lin, L: Further refinements of Gurland’s formula for zr. J. Inequal. Appl. 2013, 48 (2013). 
doi:10.1186/1029-242X-201 3-48 
Lindemann, F: Uber die Zahl zr. Math. Ann. 20, 213-225 (1882) 
Le Lionnais, F: Les Nombres Remarquables. Hermann, Paris (1983) 
Lucas, SK: Integral proofs that 355/113 > . Aust. Math. Soc. Gaz. 32, 263-266 (2005) 
Mao, Y: A short history of zr in China. Kexue 3, 411-423 (1917) 
Maor, E: The history of on the pocket calculator. J. Coll. Sci. Teach. Nov., 97-99 (1976) 
Matar, KM, Rajagopal, C: On the Hindu quadrature of the circle. J. Bombay Branch R. Asiat. Soc. 20, 77-82 (1944) 
Mikami, Y: The Development of Mathematics in China and Japan. Chelsea, New York (1913) 
Miel, G: An algorithm for the calculation of a. Am. Math. Mon. 86, 694-697 (1979) 
Miel, G: Of calculations past and present: the Archimedean algorithm. Am. Math. Mon. 90, 17-35 (1983) 
Moakes, AJ: The calculation of zr. Math. Gaz. 54, 261-264 (1970) 
Moakes, AJ: A further note on machine computation for a. Math. Gaz. 55, 306-310 (1971) 
Mortici, C: Refinement of Gurland’s formula for pi. Comput. Math. Appl. 62, 2616-2620 (2011) 
Myers, WA: The Quadrature of the Circle, the Square Root of Two, and the Right-Angled Triangle. Wilstach, Baldwin & 
Co. Printers, Cincinnati (1873) 

Nagell, T: Irrationality of the numbers e and zr. In: Introduction to Number Theory, pp. 38-40. Wiley, New York (1951) 
Nakamura, K: On the sprout and setback of the concept of mathematical ‘proof’ in the Edo period in Japan: 
regarding the method of calculating number 7. Hist. Sci. 3, 185-199 (1994) 

Nanjundiah, TS: On Huygens’ approximation to zr. Math. Mag. 44, 221-223 (1971) 

Newman, M, Shanks, D: On a sequence arising in series for 2. Math. Comput. 42, 199-217 (1984) 

Nicholson, SC, Jeenel, J: Some comments on a NORC computation of zr. Math. Tables Other Aids Comput. 9, 
162-164 (1955) 

Niven, IM: A simple proof that zr is irrational. Bull. Am. Math. Soc. 53, 507 (1947) 

Niven, IM: Irrational Numbers. Wiley, New York (1956) 


Page 57 of 59 


422 22. BIRTH, GROWTH AND COMPUTATION OF PI TO TEN TRILLION DIGITS (2013) 


Agarwal et al. Advances in Difference Equations 2013, 2013:100 
http://www.advancesindifferenceequations.com/content/2013/1/100 


149. 
150. 


151. 
152. 


153. 


154. 
155, 
156. 
157. 
158. 
159. 
160. 


161. 
162. 


163. 
164. 
165. 
166. 


167. 
168. 
169. 


170. 


171, 


172. 


173. 


174, 
175. 
176. 


177. 
178. 
179. 


180. 


181. 
182. 
183. 
184. 
185. 
186. 


187. 


188. 


189. 
190. 
191. 
192. 


193. 
194. 
195. 
196. 


197. 
198. 
199, 
200. 


alais, R: pi is wrong. Math. Intell. 23, 7-8 (2001) 

arker, JA: The Quadrature of the Circle: Setting Forth the Secrete Teaching of the Bible. Kessinger Publ., Whitefish 

(2010) 

ereira da Silva, C: A brief history of the number 7. Bol. Soc. Parana. Mat. 7, 1-8 (1986) 

louffe, S: Identities inspired from Ramanujan notebooks (Part 2). Apr. 2006. 

http://www.lacim.uqam.ca/~plouffe/inspired2.pdf 

osamentier, AS, Lehmann, |: Pi: A Biography of the World's Most Mysterious Number. Prometheus Books, New York 

(2004) 

reston, R: The mountains of zr. The New Yorker, March 2, 36-67 (1992) 

uritz, CW: An elementary method of calculating a. Math. Gaz. 58, 102-108 (1974) 

Qian, B: A study of a found in Chinese mathematical books. Kexue 8, 114-129 and 254-265 (1923) 

Rabinowitz, S, Wagon, S: A spigot algorithm for the digits of a. Am. Math. Mon. 102, 195-203 (1995) 

Rajagopal, CT, Vedamurti Aiyar, TV: A Hindu approximation to zr. Scr. Math. 18, 25-30 (1952) 

Ramanujan, S: Modular equations and approximations to mr. Q. J. Pure Appl. Math. 45(1914), 350-372 (1913-1914) 

Reitwiesner, G: An ENIAC determination of a and e to more than 2,000 decimal places. Math. Tables Other Aids 

Comput. 4, 11-15 (1950) 

Roy, R: The discovery of the series formula for z by Leibniz, Gregory, and Nilakantha. Math. Mag. 63, 291-306 (1990) 

Rutherford, W: Computation of the ratio of the diameter of a circle to its circumference to 208 places of figures. 
hilos. Trans. R. Soc. Lond. 131, 281-283 (1841) 

Sagan, C: Contact. Simon & Schuster, New York (1985) 

Salamin, E: Computation of zr using arithmetic-geometric mean. Math. Comput. 30, 565-570 (1976) 

Salikhov, V: On the irrationality measure of zr. Russ. Math. Surv. 53, 570-572 (2008) 

Schepler, HC: The chronology of PI. Math. Magazine, January-February 1950: 165-170; March-April 1950: 216-228; 

jay-June 1950: 279-283 

Schréder, EM: Zur irrationalitat von a? und zr. Mitt. Math. Ges. Hamb. 13, 249 (1993) 

Schubert, H: Squaring of the circle. Smithsonian Institution Annual Report (1890) 

Sen, SK, Agarwal, RP: Best k-digit rational approximation of irrational numbers: pre-computer versus computer era. 

Appl. Math. Comput. 199, 770-786 (2008) 

Sen, SK, Agarwal, RP: zr, e, @ with MATLAB: Random and Rational Sequences with Scope in Supercomputing Era. 

Cambridge Scientific Publishers, Cambridge (2011) 

Sen, SK, Agarwal, RP, Shaykhianb, GA: Golden ratio versus pi as random sequence sources for Monte Carlo 

integration. Math. Comput. Model. 48, 161-178 (2008) 

Sen, SK, Agarwal, RP, Shaykhian, GA: Best k-digit rational approximations-true versus convergent, decimal-based 

ones: quality, cost, scope. Adv. Stud. Contemp. Math. 19, 59-96 (2009) 

Sen, SK, Agarwal, RP, Pavani, R: Best k-digit rational bounds for irrational numbers: pre- and super-computer era. 

Math. Comput. Model. 49, 1465-1482 (2009) 

Shanks, D: Dihedral quartic approximations and series for zr. J. Number Theory 14, 397-423 (1982) 

Shanks, D, Wrench, JW Jr: Calculation of zt to 100,000 decimals. Math. Comput. 16, 76-99 (1962) 

Shanks, W: Contributions to Mathematics Comprising Chiefly the Rectification of the Circle to 607 Places of 
Decimals. Bell, London (1853) 

Shanks, W: On the extension of the numerical value of zr. Proc. R. Soc. Lond. 21, 315-319 (1873) 

Singmaster, D: The legal values of zr. Math. Intell. 7, 69-72 (1985) 

Smith, DE: History and transcendence of pi. In: Young, WJA (ed.) Monograms on Modern Mathematics. Longmans, 

Green, New York (1911) 

Smith, DE: The history and transcendence of zr. In: Young, JWA (ed.) Monographs on Topics of Modern Mathematics 
Relevant to the Elementary Field, chapter 9, pp. 388-416. Dover, New York (1955) 

Smith, DE, Mikami, Y: A History of Japanese Mathematics. Open-Court, Chicago (1914) 

van Roijen Snell, W: Cyclometricus. Leiden (1621) 

Sondow, J: A faster product for a and a new integral for In(z/2). Am. Math. Mon. 112, 729-734 (2005) 

Stern, MD: A remarkable approximation to a. Math. Gaz. 69, 218-219 (1985) 

Stevens, J: Zur irrationalitat von zr. Mitt. Math. Ges. Hamb. 18, 151-158 (1999) 

Stérmer, C: Sur l'application de la théorie des nombres entiers complexes 4 la solution en nombres rationnels 

X1)Xd)00 0 Xe C1) C2) +++, Cn, k de l@quation c, arctan x; + cp arctan x2 +-+++c¢, arctan x, = kp/4. Arch. Math. 
Naturvidensk. 19, 75-85 (1896) 

Takahasi, D, Kanada, Y: Calculation of to 51.5 billion decimal digits on distributed memory and parallel processors. 

Trans. Inf. Process. Soc. Jpn. 39(7) (1998) 

Tamura, Y, Kanada, Y: Calculation of to 4,194,293 decimals based on Gauss-Legendre algorithm. Technical report 
83-01, Computer Center, University of Tokyo (1982) 

Todd, J: A problem on arctangent relations. Am. Math. Mon. 56, 517-528 (1949) 

Trier, PE: Pi revisited. Bull. - Inst. Math. Appl. 25, 74-77 (1989) 

Tweddle, |: John Machin and Robert Simson on inverse-tangent series for mr. Arch. Hist. Exact Sci. 42, 1-14 (1991) 
Uhler, HS: Recalculation and extension of the modulus and of the logarithms of 2, 3,5, 7 and 17. Proc. Natl. Acad. Sci. 
USA 26, 205-212 (1940) 

Vega, G: Thesaurus Logarithmorum Completus. Leipzig (1794) 

Viéta, F: Uriorum de rebus mathematicis responsorum. Liber VII (1593) 

Volkov, A: Calculations of z in ancient China: from Liu Hui to Zu Chongzhi. Hist. Sci. 4, 139-157 (1994) 

Volkov, A: Supplementary data on the values of zr in the history of Chinese mathematics. Philos. Hist. Sci. Taiwan. J. 
3, 95-120 (1994) 

Volkov, A: Zhao Yougin and his calculation of zw. Hist. Math. 24, 301-331 (1997) 

Wagon, S: Is z normal. Math. Intell. 7, 65-67 (1985) 

Wells, D: The Penguin Dictionary of Curious and Interesting Numbers. Penguin, Middlesex (1986) 

Wrench, JW Jr: The evolution of extended decimal approximations to 7. Math. Teach. 53, 644-650 (1960) 


Page 58 of 59 


22. BIRTH, GROWTH AND COMPUTATION OF PI TO TEN TRILLION DIGITS (2013) 423 


Agarwal et al. Advances in Difference Equations 2013, 2013:100 
http://www.advancesindifferenceequations.com/content/2013/1/100 


201. 


202. 
203. 


204. 


205. 
206. 
207. 
208. 
209. 
210. 


Wrench, JW Jr, Smith, LB: Values of the terms of the Gregory series for arccot 5 and arccot 239 to 1,150 and 1,120 
decimal places, respectively. Math. Tables Other Aids Comput. 4, 160-161 (1950) 
Yeo, A: The Pleasures of zr, e and Other Interesting Numbers. World Scientific, Singapore (2006) 


Zebrowski, E: A History of the Circle: Mathematical Reasoning and the Physical Universe. Rutgers University Press, 


Pisacataway (1999) 

Zha, Y-L: Research on Tsu Ch’'ung-Chih's approximate method for zr. In: Science and Technology in Chinese 
Civilization, pp. 77-85. World Scientific, Teaneck (1987) 

http://mathworld.wolfram.com/PiFormulas.html 

http://en.wikipedia.org/wiki/Pi 

http://en.wikipedia.org/wiki/Pi_approximations 

en.wikipedia.org/wiki/Negative_base 

en.wikipedia.org/wiki/Super_Pi 

en.wikipedia.org/wiki/Overclocking 


doi:10.1186/1687-1847-2013-100 
Cite this article as: Agarwal et al.: Birth, growth and computation of pi to ten trillion digits. Advances in Difference 
Equations 2013 2013:100. 


Submit your manuscript to a SpringerOpen® 
journal and benefit from: 


> Convenient online submission 

> Rigorous peer review 

> Immediate publication on acceptance 

> Open access: articles freely available online 
> High visibility within the field 

> Retaining the copyright to your article 


Submit your next manuscript at > springeropen.com 


Page 59 of 59 


23. Pi day is upon us again and we still do not 
know if pi is normal (2014) 


Paper 23: David H. Bailey and Jonathan Borwein, “Pi day is upon us again and 
we still do not know if pi is normal,” American Mathematical Monthly, vol. 121 
(2014), p. 191-206. Copyright 2014 Mathematical Association of America. All 
Rights Reserved. 


Synopsis: 

This paper, which appeared, appropriately enough, on Pi day (March 14, or 
3/14 in North American notation) of 2014, discusses the enduring appeal of 7 in 
both the popular media and also in serious state-of-the-art mathematical research. 

The article provides a brief review of the origins of decimal arithmetic, the 
original formulas found to compute 7, recent formulas by Ramanujan, Salamin, 
Brent and others, the BBP formula for computing binary digits of 7 beginning at 
an arbitrary starting position, and new computer-based techniques for analyzing 
the digits of 7, such as the graphical techniques discussed in paper #21 of this 
collection, recent results on normality and, in conclusion, a list of unanswered 
questions regarding 7. 
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Pi Day Is Upon Us Again and We Still Do Not 
Know if Pi Is Normal 


David H. Bailey and Jonathan Borwein 


Abstract. The digits of z have intrigued both the public and research mathematicians from 
the beginning of time. This article briefly reviews the history of this venerable constant, and 
then describes some recent research on the question of whether z is normal, or, in other words, 
whether its digits are statistically random in a specific sense. 


1. PI AND ITS DAY IN MODERN POPULAR CULTURE. The number z, 
unique among the pantheon of mathematical constants, captures the fascination both 
of the public and of professional mathematicians. Algebraic constants such as /2 are 
easier to explain and to calculate to high accuracy (e.g., using a simple Newton itera- 
tion scheme). The constant e is pervasive in physics and chemistry, and even appears 
in financial mathematics. Logarithms are ubiquitous in the social sciences. But none 
of these other constants has ever gained much traction in the popular culture. 

In contrast, we see z at every turn. In an early scene of Ang Lee’s 2012 movie 
adaptation of Yann Martel’s award-winning book The Life of Pi, the title character 
Piscine (“Pi”) Molitor writes hundreds of digits of the decimal expansion of z on a 
blackboard to impress his teachers and schoolmates, who chant along with every digit.! 
This has even led to humorous take-offs such as a 2013 Scott Hilburn cartoon entitled 
“Wife of Pi,” which depicts a 4 figure seated next to a z figure, telling their marriage 
counselor “‘He’s irrational and he goes on and on.” [22]. 

This attention comes to a head on March 14 of each year with the celebration of “Pi 
Day,” when in the United States, with its taste for placing the day after the month, 3/14 
corresponds to the best-known decimal approximation of Pi (with 3/14/15 promis- 
ing a gala event in 2015). Pi Day was originally founded in 1988, the brainchild of 
Larry Shaw of San Francisco’s Exploratorium (a science museum), which in turn 
was founded by Frank Oppenheimer (the younger physicist brother of Robert Op- 
penheimer) after he was blacklisted by the U.S. Government during the McCarthy era. 

Originally a light-hearted gag where folks walked around the Exploratorium in 
funny hats with pies and the like, by the turn of the century Pi Day was a major educa- 
tional event in North American schools, garnering plenty of press.” In 2009, the U.S. 
House of Representatives made Pi Day celebrations official by passing a resolution 
designating March 14 as “National Pi Day,” and encouraging “schools and educators 
to observe the day with appropriate activities that teach students about Pi and engage 
them about the study of mathematics.” [23].* 

As a striking example, the March 14, 2007 New York Times crossword puzzle fea- 
tured clues, where, in numerous locations, a (standing for PI) must be entered at the 


http://dx.doi.org/10.4169/amer.math.monthly.121.03.191 
MSC: Primary 01A99, Secondary 11Z05 
'Good scholarship requires us to say that in the book Pi contents himself with drawing a circle of unit 
diameter. 
?Try www. google.com/trends?q=Pi+ to see the seasonal interest in ‘Pi’. 
3This seems to be the first legislation on Pi to have been adopted by a government, though in the late 19th 
century Indiana came embarrassingly close to legislating its value, see [12, Singmaster, Entry 27] and [14]. 
This MONTHLY played an odd role in that affair. 
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intersection of two words. For example, 33 across “Vice president after Hubert” (an- 
swer: Sz1RO) intersects with 34 down “Stove feature” (answer: 2 LOT). Indeed, 28 
down, with clue “March 14, to mathematicians,” was, appropriately enough, PIDAY, 
while PIPPIN is now a four-letter word. The puzzle and its solution are reprinted with 
permission in [15, pp. 312-313]. 


x Mania in popular culture. Many instances are given in [14]. They include the 
following: 


1. On September 12, 2012, five aircraft armed with dot-matrix-style skywriting 
technology wrote 1000 digits of z in the sky above the San Francisco Bay area 
as a spectacular and costly piece of piformance art. 


2. On March 14, 2012, U.S. District Court Judge Michael H. Simon dismissed a 
copyright infringement suit relating to the lyrics of a song by ruling that “Pi is a 
non-copyrightable fact.” 


3. On the September 20, 2005 edition of the North American TV quiz show Jeop- 
ardy!, in the category “By the numbers,” the clue was “‘How I want a drink, 
alcoholic of course’ is often used to memorize this.” (Answer: What is Pi?). 


4. On August 18, 2005, Google offered 14,159,265 “new slices of rich technology” 
in their initial public stock offering. On January 29, 2013 they offered a z million 
dollar prize for successful hacking of the Chrome Operating System on a specific 
Android phone. 

5. In the first 1999 Matrix movie, the lead character Neo has only 314 seconds to 
enter the Source. Time noted the similarity to the digits of z. 

6. The 1998 thriller “Pi” received an award for screenplay at the Sundance film fes- 
tival. When the authors were sent advance access to its website, they diagnosed 
it a fine hoax. 


7. The May 6, 1993 edition of The Simpsons had Apu declaring “I can recite pi to 
40,000 places. The last digit is 1.’ This digit was supplied to the screenwriters 
by one of the present authors. 

8. In Carl Sagan’s 1986 book Contact, the lead character (played by Jodie Foster 
in the movie) searched for patterns in the digits of 2, and after her mysterious 
experience sought confirmation in the base-11 expansion of z. 


With regards to item #3 above, there are many such “pi-mnemonics” or “piems” 
(i.e., phrases or verse whose letter count, ignoring punctuation, gives the digits of 7) 
in the popular press [12, 14]. Another is “Sir, I bear a rhyme excelling / In mystic force 
and magic spelling / Celestial sprites elucidate / All my own striving can’t relate.” [13, 
p. 106]. Some are very long [12, Keith, Entry 59, pp. 560-561]. 

Sometimes the attention given to z is annoying, such as when on August 14th, 2012, 
the U.S. Census Office announced the population of the country had passed exactly 
314,159,265. Such precision was, of course, completely unwarranted. Sometimes the 
attention is breathtakingly pleasurable.*'> 


Poems versus piems. While piems are fun they are usually doggerel. To redress this, 
we include examples of excellent z poetry and song.° In Figure 1 we present the much 
anthologised poem “PI,” by Polish poet Wislawa Szymborska (1923-2012) who won 


4See the 2013 movie at http://www. youtube. com/watch?v=Vp9zLbIE8zo. 

5A comprehensive Pi Day presentation is lodged at http: //www.carma.newcastle.edu.au/jon/ 
piday.pdf. 

®See also [12, Irving Kaplansky’s “A song about Pi.”]. 
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the 1996 Novel prize for literature [29, p. 174]. In Figure 2 we present the lyrics of 
“Pi” by the influential British singer songwriter Kate Bush [18]. The Observer review 
of her 2005 collection Aerial, on which the song appears, wrote that it is 


a sentimental ode to a mathematician, audacious in both subject matter and treat- 
ment. The chorus is the number sung to many, many decimal places.’ 


The admirable number pi: 

three point one four one. 

All the following digits are also just a start, 

five nine two because it never ends. 

It can’t be grasped, six five three five, at a glance, 

eight nine, by calculation, 

seven nine, through imagination, 

or even three two three eight in jest, or by comparison 
four six to anything 

two six four three in the world. 

The longest snake on earth ends at thirty-odd feet. 
Same goes for fairy tale snakes, though they make it a little longer. 
The caravan of digits that is pi 

does not stop at the edge of the page, 

but runs off the table and into the air, 

over the wall, a leaf, a bird’s nest, the clouds, straight into the sky, 
through all the bloatedness and bottomlessness. 

Oh how short, all but mouse-like is the comet’s tail! 
How frail is a ray of starlight, bending in any old space! 
Meanwhile two three fifteen three hundred nineteen 
my phone number your shirt size 

the year nineteen hundred and seventy-three sixth floor 
number of inhabitants sixty-five cents 

hip measurement two fingers a charade and a code, 

in which we find how blithe the trostle sings! 

and please remain calm, 

and heaven and earth shall pass away, 

but not pi, that won’t happen, 

it still has an okay five, 

and quite a fine eight, 

and all but final seven, 

prodding and prodding a plodding eternity 

to last. 


Figure 1. “PI,” by Wislawa Szymborska 


2. PRE-DIGITAL HISTORY. z is arguably the only mathematical topic from 
very early history that is still being researched today. The Babylonians used the 
approximation z ~ 3. The Egyptian Rhind Papyrus, dated roughly 1650 BCE, sug- 
gests 7 = 32/18 = 3.16049.... Early Indian mathematicians believed 7 = /10 = 
3.162277.... Archimedes, in the first mathematically rigorous calculation, employed 
a clever iterative construction of inscribed and circumscribed polygons to establish that 


3 10/71 = 3.14084...< am <3 1/7 = 3.14285... 


7She sings over 150 digits but errs after 50 places. The correct digits occurred with the published lyrics. 
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Sweet and gentle sensitive man 

With an obsessive nature and deep fascination 
For numbers 

And a complete infatuation with the calculation 
Of PI 


Oh he love, he love, he love 

He does love his numbers 

And they run, they run, they run him 
In a great big circle 

In a circle of infinity 


3.1415926535 897932 
3846 264 338 3279 


Oh he love, he love, he love 

He does love his numbers 

And they run, they run, they run him 
In a great big circle 

In a circle of infinity 

But he must, he must, he must 

Put a number to it 


50288419 716939937510 
582319749 44 59230781 
6406286208 821 4808651 32 


Oh he love, he love, he love 

He does love his numbers 

And they run, they run, they run him 
In a great big circle 

In a circle of infinity 


82306647 0938446095 505 8223... 
Figure 2. “Pi,” by Kate Bush 


This amazing work, done without trigonometry or floating point arithmetic, is charm- 
ingly described by George Phillips [12, Entry 4]. 


Life after modern arithmetic. The advent of modern positional, zero-based decimal 
arithmetic, most likely discovered in India prior to the fifth century [4, 27], signifi- 
cantly reduced computational effort. Even though the Indo-Arabic system, as it is now 
known, was introduced to Europeans first by Gerbert of Aurillac (c. 946-1003, who 
became Pope Sylvester II in 999) in the 10th century, and again, in greater detail and 
more successfully, by Fibonacci in the early 13th century, Europe was slow to adopt 
it, hampering progress in both science and commerce. In the 16th century, prior to the 
widespread adoption of decimal arithmetic, a wealthy German merchant was advised, 
regarding his son’s college plans, 


If you only want him to be able to cope with addition and subtraction, then 
any French or German university will do. But if you are intent on your son going 
on to multiplication and division—assuming that he has sufficient gifts—then 
you will have to send him to Italy. [24, p. 577] 


Life after calculus. Armed with decimal arithmetic and modern calculus, 17th-, 
18th-, and 19th-century mathematicians computed z with aplomb. Newton recorded 
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16 digits in 1665, but later admitted, “I am ashamed to tell you how many figures I 
carried these computations, having no other business at the time.” In 1844 Dase, under 
the guidance of Strassnitzky, computed 212 digits correctly in his head [14]. These 
efforts culminated with William Shanks (1812-1882), who employed John Machin’s 


formula 
4 1 1 
— = Aarctan | — } — arctan | —— ], (1) 
4 5 239 


where arctanx = x — x°/3 + «5/5 — x7/7+ x°/9 —---, to compute 707 digits in 
1874. His 1853 work to 607 places was funded by 30 subscriptions from such notables 
as Rutherford, De Morgan (two copies), Herschel (Master of the Mint and son of the 
astronomer) and Airy.® 

Alas, only 527 digits were correct (as Ferguson found nearly a century later in 1946 
using a calculator), confirming the suspicions of De Morgan at the time, who asserted 
that there were too many sevens in Shanks’ published result (although the statistical 
deviation was not as convincing as De Morgan thought [26]). A brief summary of this 
history is shown in Table 1. We note that Sharp was a cleric, Ferguson was a school 
teacher, and Dase a “kopfrechnenner.” Many original documents relating to this history 
can be found in [12]. 


Table 1. Brief chronicle of pre-20th-century z calculations 


Archimedes 250? BCE 3 3.1418 (ave.) 
Liu Hui 263 5 3.14159 

Tsu Ch’ung Chi 480? 7 3.1415926 
Al-Kashi 1429 14 

Romanus 1593 15 

Van Ceulen 1615 39 (35 correct) 
Newton 1665 16 

Sharp 1699 71 

Machin 1706 100 

De Lagny 1719 127 (112 correct) 
Vega 1794 140 

Rutherford 1824 208 (152 correct) 
Strassnitzky and Dase 1844 200 

Rutherford 1853 440 

Shanks 1853 607 (527 correct) 
Shanks 1873 707 (527 correct) 


Mathematics of Pi. Alongside these numerical developments, the mathematics be- 
hind z enjoyed comparable advances. In 1761, using improper continued fractions, 
Lambert [12, Entry 20] proved that z is irrational, thus establishing that the digits of 
m never repeat. Then in 1882, Lindemann [12, Entry 22] proved that e® is transcen- 
dental for every nonzero algebraic number a, which immediately implied that z is 
transcendental (since e’” = —1). This result settled in decisive terms the 2000-year- 
old question of whether a square could be constructed with the same area as a circle, 
using compass and straightedge (it cannot, because if it could then z would be a geo- 
metrically constructible number and hence algebraic). 


8He had originally intended to present only about 500 places, and evidently added the additional digits while 
finishing the galleys a few months later [12, Entry 20]. Errors introduced in a rush to publish are not new. 
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3. THE TWENTIETH CENTURY AND BEYOND. With the development of 
computer technology in the 1950s and 1960s, 2 was computed to thousands of digits, 
facilitated in part by new algorithms for performing high-precision arithmetic, notably 
the usage of fast Fourier transforms to dramatically accelerate multiplication. 


Ramanujan-type series for 1/m. Even more importantly, computations of 2 began 
to employ some entirely new mathematics, such as Ramanujan’s 1914 formula 


1 2/2 > (4k)! (1103 + 26390k) (2) 


x 9801 (k!)4 396% ; 


k=0 


each term of which produces an additional eight correct digits in the result [16]. David 
and Gregory Chudnovsky employed the variant 


1 *\ (—1)* (6k)! (13591409 + 545140134k 
Loy et - ) a 
k=0 


(3k)! (k!)3 64032074+3/2 


each term of which adds 14 correct digits. Both of these formulas rely on rather deep 
number theory [14] and related modular-function theory [16]. 


Reduced complexity algorithms [17] for 1/2. Another key development in the mid 
1970s was the Salamin—Brent algorithm (12, Entries 46 and 47] for a: Set ao = 
1, by = 1/V2, and so = 1/2. Then for k > 1, iterate 


_ by_ 
ace i TY 


2 


ayia pe = 9k _ 2a; 4 
Ch = a, — b; Sk = Spy — 2°Cx a (4) 
k 


The value of p, converges quadratically to m—each iteration approximately doubles 
the number of correct digits. 

A related algorithm, inspired by a 1914 Ramanujan paper, was found in 1986 by 
one of us and Peter Borwein [16]: Set ay = 6 — 4/2 and yo = J/2 — 1. Then for 
k > 0, iterate 


ie ad _ yt l4 

kt = ——__*__ 4 (5) 
1+(+y," 

Ae = aL + yep) — 2" yen + yee + Ye): 


Then a, converges quartically to 1/m—each iteration approximately quadruples the 
number of correct digits. Just twenty-one iterations suffices to produce an algebraic 
number that agrees with z to more than six trillion digits (provided all iterations are 
performed with this precision). 

With discoveries such as these, combined with prodigious improvements in com- 
puter hardware (thanks to Moore’s Law) and clever use of parallelism, 2 was computed 
to millions, then billions, and, in 2011, to 10 trillion decimal digits. A brief chronicle 
of x computer-age computations is shown in Table 2.” 


°Tt is probably unnecessary to note that the Shanks of this table is not the Shanks of Table 1. 
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Table 2. Brief chronicle of computer-age z calculations 


Ferguson 1945 620 
Smith and Wrench 1949 1,120 
Reitwiesner et al. (ENIAC) 1949 2,037 
Guilloud 1959 16,167 
Shanks and Wrench 1961 100,265 
Guilloud and Bouyer 1973 1,001,250 
Kanada, Yoshino and Tamura 1982 16,777,206 
Gosper 1985 17,526,200 
Bailey Jan. 1986 29,360,111 
Kanada and Tamura Jan. 1988 201,326,551 
Kanada and Tamura Nov. 1989 1,073,741,799 
David and Gregory Chudnovsky Aug. 1991 2,260,000,000 
Kanada and Takahashi Apr. 1999 51,539,600,000 
Kanada and Takahashi Sep. 1999 206, 158,430,000 
Kanada and 9 others Nov. 2002 1,241,100,000,000 
Bellard Dec. 2009 2,699,999,990,000 
Kondo and Yee Aug. 2010 5,000,000,000,000 
Kondo and Yee Oct. 2011 10,000,000,000,000 


4. COMPUTING DIGITS OF x AT AN ARBITRARY STARTING POSITION. 
A recent reminder of the folly of thinking that wz is fully understood was the 1996 
discovery of a simple scheme for computing binary or hexadecimal digits of 2, begin- 
ning at an arbitrary starting position, without needing to compute any of the preceding 
digits. This scheme is based on the following formula, which was discovered by a 
computer program implementing Ferguson’s “PSLQ” algorithm [9, 20]: 


ioe) 


y 1 4 2 1 1 6) 
r= ; 
= AGF 8kK+1 8k+4 8k+5 8k+6 


The proof of this formula (now known as the “BBP” formula for 7) is a relatively sim- 
ple exercise in calculus. It is perhaps puzzling that it had not been discovered centuries 
before. But then no one was looking for such a formula. 


How bits are extracted. The scheme to compute digits of 2 beginning at an arbi- 
trary starting point is best illustrated by considering the similar (and very well known) 
formula for log 2: 


ess (7) 


Note that the binary expansion of log 2 beginning at position d + | is merely the frac- 
tional part of 2 log 2, so that we can write (where {-} denotes fractional part): 


d_ »d-k oo d—k 
otto} = | [> Feet reath «| yo =|]. (8) 
k=1 k=d+1 


Now note that the numerators of the first summation can be computed very rapidly 
by means of the binary algorithm for exponentiation, namely the observation, for ex- 
ample, that 3!7 mod 10 = ((((3? mod 10)? mod 10)* mod 10)? mod 10) - 3 mod 10. 
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This same approach can be used to compute binary or hexadecimal digits of 7 us- 
ing (6). 

This scheme has been implemented to compute hexadecimal digits of 7 begin- 
ning at stratospherically high positions. In July 2010, for example, Tsz-Wo Sze 
of Yahoo! Cloud Computing computed base-16 digits of a beginning at position 
2.5 x 10'4. Then on March 14 (Pi Day), 2013, Ed Karrels of Santa Clara University 
computed 26 base-16 digits beginning at position one quadrillion [25]. His result: 
8353CB3F7FOC9OACCFA9AA2 15F2. 


Beyond utility. Certainly, there is no need for computing z to millions or billions of 
digits in practical scientific or engineering work. A value of z to 40 digits is more than 
enough to compute the circumference of the Milky Way galaxy to an error less than 
the size of a proton. There are certain scientific calculations that require intermediate 
calculations to be performed to higher than standard 16-digit precision (typically 32 or 
64 digits may be required) [3], and certain computations in the field of experimental 
mathematics have required as high as 50,000 digits [6], but we are not aware of any 
“practical” applications beyond this level. 

Computations of digits of 2 are, however, excellent tests of computer integrity— 
if even a single error occurs during a large computation, almost certainly the final 
result will be in error, resulting in disagreement with a check calculation done with 
a different algorithm. For example, in 1986, a pair of z-calculating programs using 
(4) and (5) detected some obscure hardware problems in one of the original Cray- 
2 supercomputers.!° Also, some early research into efficient implementations of the 
fast Fourier transform on modern computer architectures had their origins in efforts to 
accelerate computations of z [2]. 


5. NEW TECHNIQUES TO EXPLORE NORMALITY AND RELATED PROP- 
ERTIES. Given an integer b > 2, a real number a is said to be b-normal or normal 
base b if every m-long string of base-b digits appears in the base-b expansion of a 
with limiting frequency 1/b’". It is easy to show via measure theory that almost all real 
numbers are b-normal for every b > 2 (a condition known as absolute normality), but 
establishing normality for specific numbers has proven to be very difficult. 

In particular, no one has been able to establish that z is b-normal for any integer b, 
much less for all bases simultaneously. It is a premier example of an intriguing mathe- 
matical question that has occurred to countless schoolchildren as well as professional 
mathematicians through the ages, but which has defied definitive answer to the present 
day. A proof for any specific base would not only be of great interest worldwide, but 
would also have potential practical application as a provably effective pseudorandom 
number generator. This ignorance extends to other classical constants of mathematics, 
including e, log2, 2, and y (Euler’s constant). Borel conjectured that all irrational 
algebraic numbers are absolutely normal, but this has not been proven in even a single 
instance, to any base. 

Two examples where normality has been established are Champernowne’s number 
Cio = 0.12345678910111213 ... (constructed by concatenating the positive integers), 
which is provably 10-normal, and Stoneham’s number 2,3 = )>;.9 1/(3*2"), which 
is provably 2-normal—see below [10, 11, 28]. One relatively weak result for algebraic 
numbers is that the number of |-bits in the binary expansion of a degree-D algebraic 
number a must exceed Cn!/” for all sufficiently large n, for a positive number C that 


!0Cray’s own tests did not find these errors. After that, these z algorithms were included in Cray’s test suite 
in the factory. 
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depends on a [8]. Thus, for example, the number of 1-bits in the first n bits of the 
binary expansion of 2 must exceed ./n. 

In spite of these intriguing developments, it is clear that more powerful techniques 
must be brought to bear on the question of normality, either for 7 or other well-known 
constants of mathematics, before significant progress can be achieved. Along this line, 
modern computer technology suggests several avenues of research. 


Statistical analysis. One approach is simply to perform large-scale statistical analyses 
on the digits of 2, as has been done, to some degree, on nearly all computations since 
ENIAC. In [7], for example, the authors empirically tested the normality of its first 
roughly four trillion hexadecimal (base-16) digits using a Poisson process model, and 
concluded that, according to this test, it is “extraordinarily unlikely” that zr is not 16- 
normal (of course, this result does not pretend to be a proof). 


Graphical representations. Another fruitful approach is to display the digits of 2 or 
other constants graphically, cast as a random walk [1]. For example, Figure 3 shows a 
walk based on one million base-4 pseudorandom digits, where at each step the graph 
moves one unit east, north, west, or south, depending on the whether the pseudoran- 
dom iterate at that position is 0, 1, 2, or 3. The color indicates the path followed by the 
walk—shifted up the spectrum (red-orange-yellow-green-cyan-blue-purple-red) fol- 
lowing an HSV scheme with S and V equal to one. The HSV (hue, saturation, and 
value) model is a cylindrical-coordinate representation that yields a rainbow-like range 
of colors. 


Figure 3. A uniform pseudorandom walk 


Figure 4 shows a walk on the first 100 billion base-4 digits of x. This may be viewed 
dynamically in more detail online at http: //gigapan.org/gigapans/106803, 
where the full-sized image has a resolution of 372,224 x 290,218 pixels (108.03 billion 
pixels in total). This is one of the largest mathematical images ever produced and, 
needless to say, its production was by no means easy [1]. 
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Figure 4. A walk on the first 100 billion base-4 digits of z 


Although no clear inference regarding the normality of z can be drawn from these 
figures, it is plausible that z is 4-normal (and thus 2-normal), since the overall ap- 
pearance of its graph is similar to that of the graph of the pseudorandomly generated 
base-4 digits. 


The Champernowne numbers. We should emphasize what a poor surrogate for ran- 
domness the notion of normality actually is. The base-b Champernowne number, C,j, is 
formed by concatenating the natural numbers base b as a floating-point number in that 
base. It was the first type of number proven to be normal and fails stronger normality 
tests [1]. Thus, 


a 
= by m(b — Don! 


k=1 


ze 
o 
| 


= 0.123456789101112... 


C4 = 0.1231011121320212223...4. (9) 


In Figure 5 we show how far from random a walk on a normal number may be— 
pictorially or by many quantitative measures—as illustrated by C4.!! 


Stoneham numbers. This same tool can be employed to study the digits of Stone- 
ham’s constant, namely 


“1 
02,3 = = 3k 3k ° oy 

k=0 
This constant is one of the few that is provably 2-normal (and thus 2”-normal, for every 
positive integer n) [10, 11, 28]. What’s more, it is provably not 6-normal, so that it is 


' The subscript four denotes a base-four representation. 


200 © THE MATHEMATICAL ASSOCIATION OF AMERICA [Monthly 121 


23. PI DAY AGAIN AND WE STILL DO NOT KNOW IF PI IS NORMAL (2014) 


Figure 5. A walk on Champernowne’s base-4 number 


an explicit example of the fact that normality in one base does not imply normality 
in another base [5]. For other number bases, including base 3, its normality is not yet 
known one way or the other. 

Figures 6, 7, and 8 show walks generated from the base-3, base-4, and base-6 digit 
expansions, respectively, of a2,3. The base-4 digits are graphed using the same scheme 
mentioned above, with each step moving east, north, west, or south, according to 
whether the digit is 0, 1,2, or 3. The base-3 graph is generated by moving unit dis- 
tance at an angle 0, 2/3, or 27/3, respectively, for 0, 1, or 2. Similarly, the base-6 
graph is generated by moving unit distance at angle kw/6 fork = 0,1,...,5. 


Figure 6. A walk on the base-3 digits of Stoneham’s constant (@2,3) 
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Figure 7. A walk on the provably normal base-4 digits of Stoneham’s constant (2,3) 


ad if 


Figure 8. A walk on the abnormal base-6 digits of Stoneham’s constant (@2,3) 


From these three figures, it is clear that while the base-3 and base-4 graphs appear 
to be plausibly random (since they are similar in overall structure to Figures 3 and 4), 
the base-6 walk is vastly different, mostly a horizontal line. Indeed, we discovered the 
fact that a,3 fails to be 6-normal by a similar empirical analysis of the base-6 digits— 
there are long stretches of zeroes in the base-6 expansion [5]. Results of this type are 
given in [1] for numerous other constants besides 7, both “man-made” and “natural.” 

Such results certainly do not constitute formal proofs, but they do suggest, of- 
ten in dramatic form, as we have seen, that certain constants are not normal, and 
can further be used to bound statistical measures of randomness. For example, re- 
markable structure was uncovered in the normal Stoneham numbers [1]. Moreover, 
many related quantitative measures of random walks were examined, as were other 
graphical representations. Much related information, including animations, is stored at 
http://carma.newcastle.edu.au/walks/. 


6. OTHER UNANSWERED QUESTIONS. 


Mathematical questions. There are, of course, numerous other unanswered mathe- 
matical questions that can be posed about z. 


1. Are the continued fraction terms of 2 bounded or unbounded? The continued 
fraction expansion provides information regarding how accurately z can be writ- 
ten as a fraction. 
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2. Is the limiting fraction of zeroes in the binary expansion of z precisely 1/2? 
Is the limiting fraction of zeroes in the decimal expansion precisely 1/10? We 
do not know the answers to these questions even for simple algebraic constants 
such as afd: much less zr. 


3. Are there infinitely many ones in the ternary expansion of z? Are there infinitely 
many sevens in the decimal expansion of 2? Sadly, we cannot definitively an- 
swer such basic questions one way or the other. 


Meta-mathematical questions. For that matter, there are numerous historical ques- 
tions that are worth asking, if only rhetorically. 


1. Why was z not known more accurately in ancient times? It could have been 
known to at least two-digit accuracy by making careful measurements with 
a rope. 


2. Why did Archimedes, in spite of his astonishing brilliance in geometry and cal- 
culus, fail to grasp the notion of positional, zero-based decimal arithmetic? This 
would have greatly facilitated his computations (and likely would have changed 
history as well). 


3. Why did Indian mathematicians fail to extend their system of decimal arithmetic 
for integers to decimal fractions? Decimal fraction notation was first developed 
in the Arabic world in the 12th century. They managed by scaling their results, 
but missed the obvious. 


4. Why did Gauss and Ramanujan fail to exploit their respective identities for 2? 
After all, the Salamin-Brent quadratically convergent algorithm for z is derived 
directly from some identities of Gauss, and other algorithms for z follow from 
(then largely unproven) formulas of Ramanujan. For that matter, why was the 
notion of an algorithm, fundamental in our computer age, so foreign to their 
way of thinking? 


5. Why did centuries of mathematicians fail to find the BBP formula for 27, namely 
formula (6), not to mention the associated “trick” for computing digits at an 
arbitrary starting position? After all, as mentioned above, it can be proven in just 
a few steps with freshman-level calculus. 


In any event, it is clear that modern computer technology has changed the game for 
zt. Modern systems are literally billions of times faster and more capacious than their 
predecessors when the present authors began their careers, and advances in software 
(such as fast Fourier transforms for high-precision numerical computation and sym- 
bolic computing facilities for algebraic manipulations) have improved computational 
productivity just as much as hardware improvements. 

And computers are no longer merely passive creatures. A computer program dis- 
covered the BBP formula for z, as well as similar formulas for numerous other 
constants. Other formulas for 2 have been discovered by computer in a similar way, 
using high-precision implementations of the PSLQ algorithm or related integer rela- 
tion algorithms. 


Two unproven facts. In some of these cases, such as the following two formulas, 
proofs remain elusive: 
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Ay Qar'(k) (1+ 14k + 76K? + 168k") 
= » Q2k+1 ; (11) 


2048 » 3 (he Di De 
(1); 12k 


(21 + 466k + 4340k? + 20632k* + 43680k*), (12) 


where, in the first (due to Gourevich in 2001), r(k) = 1/2 -3/4----- (2k — 1)/(2k), 
and, in the second (due to Cullen in 2010), the notation (x), = x(x + 1)(x + 
1)---(« +n — 1) is the Pochhammer symbol. 


7. CONCLUSION. The mathematical constant z has intrigued both the public and 
professional mathematicians for millennia. Countless facts have been discovered about 
z and published in the mathematical literature. But, as we have seen, much misunder- 
standing abounds. We must also warn the innocent reader to beware of mathematical 
terrorists masquerading as nice people, in their evil attempt to replace zm by t = 27 
(which is pointless in any event, since the binary expansion of Tt is the same as 7, 
except for a shift of the decimal point).'” 

Yet there are still very basic questions that remain unanswered, among them 
whether (and why) z is normal to any base. Indeed, why do the basic algorithms 
of arithmetic, implemented to compute constants such as 7, produce such random- 
looking results? And can we reliably exploit these randomness-producing features for 
benefit, say, as commercial-quality pseudorandom number generators? 

Other challenges remain as well. But the advent of the computer might at last give 
humankind the power to answer some of them. Will computers one day be smarter than 
human mathematicians? Probably not any time soon, but for now they are remarkably 
pleasant research assistants. 
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Haiku 2:57 


I will come to bed 
when all three numbers in the 
clock are prime ... again. 


—Submitted by Terry Trowbridge 
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Synopsis: 
This paper presents the panorama of 7 through the ages, with a brief summary 
of 7 from Archimedes and others of antiquity to Renaissance times and finally to 
the computer age. Significant mathematical detail is provided, including a reprise 
of Ivan Niven’s 1947 elegant proof that a is irrational, numerous formulas that 
have been used to compute 7, quadratically convergent algorithms for 7, the BBP 

formula and algorithm for 7 and various curiosities, such as the fact that 
ra (1—2)*a4 i 94° 4t3 
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= L 45 
Ine ae 3 tt 3 + 4t — 4arctant, 


from which one can prove the age-old approximation 
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1 Preamble: 7 and Popular Culture 


The desire to understand 7, the challenge, and originally the need, to calculate ever more accurate values of 7, 
the ratio of the circumference of a circle to its diameter, has challenged mathematicians—great and less great—for 
many centuries. It has also, especially recently, provided compelling examples of computational mathematics. 
7, uniquely in mathematics, is pervasive in popular culture and the popular imagination. 


I shall intersperse this largely chronological account of m’s mathematical status with examples of its ubiquity. 
More details will be found in the selected references at the end of the chapter—especially in Pi: a Source Book 
[9]. In [9] all material not otherwise referenced may be followed up upon, as may much other material, both 
serious and fanciful. Other interesting material is to be found in [21], which includes attractive discussions of 
topics such as continued fractions and elliptic integrals. 


Fascination with a is evidenced by the many recent popular books, television shows, and movies—even 
perfume—that have mentioned 7. In the 1967 Star Trek episode “Wolf in the Fold,” Kirk asks “Arent there 
some mathematical problems that simply cant be solved?’ And Spock ‘fries the brains’ of a rogue computer by 
telling it: “Compute to the last digit the value of Pi.” (Figure 1 illustrates how much more is now known, see 
also http://carma.newcastle.edu.au/piwalk.shtml.) The May 6, 1993 episode of The Simpsons has the 
character Apu boast “I can recite pi to 40,000 places. The last digit is one.” 


In November 1996, MSNBC aired a Thanksgiving Day segment about 7, including that scene from Star 
Trek and interviews with the present author and several other mathematicians at Simon Fraser University. The 
1997 movie Contact, starring Jodie Foster, was based on the 1986 novel by noted astronomer Carl Sagan. In 
the book, the lead character searched for patterns in the digits of 7, and after her mysterious experience found 
confirmation—that the universe had meaning— in the base-11 expansion of 7. The 1997 book The Joy of Pi [11] 
has sold many thousands of copies and continues to sell well. The 1998 movie entitled Pi began with decimal 
digits of 7 displayed on the screen. And in the 2003 movie Matrix Reloaded, the Key Maker warns that a door 
will be accessible for exactly 314 seconds, a number that Time speculated was a reference to 7. 


As a striking example, imagine the following excerpt from Yann Mandel’s 2002 Booker Prize winning novel 
Life of Pi being written about any other transcendental number: 


“My name is 
Piscine Molitor Patel 
known to all as Pi Patel. 


For good measure I added 

m= 3.14 
and I then drew a large circle which I sliced in two with a diameter, to evoke that basic lesson of 
geometry.” 


Equally, National Public Radio reported on April 12, 2003 that novelty automatic teller machine withdrawal 
slips, showing a balance of $314, 159.26, were hot in New York City. One could jot a note on the back and, 


1This paper is an updated and revised version of [13] and is made with permission of the editor. 
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apparently innocently, let the intended target be impressed by one’s healthy saving account. Scott Simon, the 
host, noted the close resemblance to 7. Correspondingly, according to the New York Times of August 18 2005, 
Google offered exactly “14,159,265 New Slices of Rich Technology” as the number of shares in its then new 
stock offering. Likewise, March 14 in North America has become 7 Day, since in the USA the month is written 
before the day (‘314’). In schools throughout North America, it has become a reason for mathematics projects, 
especially focussing on 7. 


In another sign of true legitimacy, on March 14, 2007 the New York Times published a crossword in which 
to solve the puzzle, one had first to note that the clue for 28 DOWN was “March 14, to Mathematicians,” to 
which the answer is PIDAY. Moreover, roughly a dozen other characters in the puzzle are pI—for example, the 
clue for 5 DOWN was “More pleased” with the six character answer HAPTER. The puzzle is reproduced in [14]. 


Other and more recent examples—including the US Congressional House Resolution 224 designating of 
National Pi Day in 2009—may be found at http://www.carma.newcastle.edu.au/jon/piday.pdf which is 
annually updated. 


It is hard to imagine e, y or log 2 playing the same role. A corresponding scientific example [4, p. 11] is 


A coded message, for example, might represent gibberish to one person and valuable information to 
another. Consider the number 14159265... Depending on your prior knowledge, or lack thereof, it is 
either a meaningless random sequence of digits, or else the fractional part of pi, an important piece 
of scienti c information. 


Again, a scientist can use 7 confident that it is part of shared societal knowledge—even if when pressed his 
definition might not be as good as Pi’s. 


For those who know The Hitchhiker’s Guide to the Galaxy, it is amusing that 042 occurs at the digits ending 
at the fifty-billionth decimal place in each of 7 and 1/m—thereby providing an excellent answer to the ultimate 
question of ‘life, the universe and everything‘, which was “What is forty two?” A more intellectual offering is 
“The Deconstruction of Pi” given by Umberto Eco on page three of his 1988 book Foucault s Pendulum, {9, p. 
658]. The title says it all. 


Pi. Our central character 
mT = 3.14159265358979323... 


is traditionally defined in terms of the area or perimeter of a unit circle, but see Figure 4 where the subtlety 
of showing the two are the same is illustrated. The notation of 7 itself was introduced by William Jones in 
1737, replacing ‘p’ and the like, and was popularized by Leonhard Euler who is responsible for much modern 
nomenclature. A more formal modern definition of 7 uses the first positive zero of the sine function defined as 
a power series. The first thousand decimal digits of 7 are recorded in Figure 2. 


Despite continuing rumours to the contrary, 7 is not equal to 22/7 (see End Note 1). Of course 22/7 is one 
of the early continued fraction approximations to 7. The first six convergents are 


22 333 355 103993 104348 
* 7? 106’ 113’ 33102’ 33215 ° 


The convergents are necessarily good rational approximations to 7. The sixth differs from 7 by only 3.311071°. 


The corresponding simple continued fraction starts 
= (8; 7,.15,1;:292,.1,151, 2; 1,3) 1,14,2,.1,1, 2,2; 25251, 84, 251, 15.4], 


using the standard concise notation. This continued fraction is still very poorly understood. Compare that for 
e which starts 
e = [2,1,2,1,1,4,1,1,6,1,1,8,1,1,10,1,1,12,1,1,14,1,1,16,1,1,18,...]. 
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Figure 1: A 100 billion step planar walk on the binary digits of 7. Colors change from red and orange to violet 
and indigo as we proceed (snapshot taken from image in [1]) 


3. 141592653589793238462643383279502884197 169399375 1058209749445923078164062862089986280348253421170679 
821480865 1328230664709384460955058223172535940812848111745028410270193852110555964462294895493038196 
442881097566593344612847564823378678316527 1 2019091456485669234603486 10454326648213393607260249141273 
7245870066063155881748815209209628292540917153643678925903600113305305488204665213841469519415116094 
3305727036575959195309218611738193261179310511854807446237996274956735 188575272489 122793818301194912 
9833673362440656643086021394946395224737190702179860943702770539217176293176752384674818467669405132 
0005681271452635608277857713427577896091736371787214684409012249534301465495853710507922796892589235 
4201995611212902196086403441815981362977477130996051870721134999999837297804995105973173281609631859 
5024459455346908302642522308253344685035261931188171010003137838752886587533208381420617177669147303 
59825349042875546873115956286388235378759375195778185778053217122680661300192787661119590921642019893 


Figure 2: 1,001 Decimal digits of 7 with the ‘3’ included 


Archimedes: 223/71 < 1 < 22/7 


Figure 3: A pictorial proof of Archimedes’ inequalities 
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MEASUREMENT OF A CIRCLE. 


Proposition 1. 


The area of any circle is equal to a right-angled triangle in 
which one of the sides about the right angle is equal to the radius, 
and the other to the circumference, of the circle. 


Let ABCD be the given circle, X the triangle described. 


Figure 4: Construction showing uniqueness of 7, taken from Archimedes’ Measurement of a Circle 


A proof of this observation shows that e is not a quadratic irrational since such numbers have eventually periodic 
continued fractions. 


Archimedes’ famous computation discussed below is: 


10 10 
1 3 <a7<3E. 
@) i.) 
Figure 3 shows this estimate graphically, with the digits shaded modulo ten; one sees structure in 22/7, less 
obviously in 223/71, and presumptively not in 7. 


2 The Childhood of 7 


Four thousand years ago, the Babylonians used, among other values, the approximation 35 = 3.125. Then, 
or earlier, according to ancient papyri, Egyptians assumed a circle with diameter nine has the same area as a 
square of side eight, which implies 7 = 256/81 = 3.1604.... Some have argued that the ancient Hebrews were 
satisfied with 7 = 3: 


Also, he made a molten sea of ten cubits from brim to brim, round in compass, and ve cubits 
the height thereof; and a line of thirty cubits did compass it round about. (I Kings 7:23; see also 2 
Chronicles 4:2) 


One should know that the cubit was a personal not universal measurement. In Judaism’s further defense, 
several millennia later, the great Rabbi Moses ben Maimon Maimonedes (1135-1204), also known as the RaM- 
BaM, is translated by Langermann, in “The ‘true perplexity’ [9, p. 753] as fairly clearly asserting the irrational 
nature of 7: 
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“You ought to know that the ratio of the diameter of the circle to its circumference is unknown, nor 
will it ever be possible to express it precisely. This is not due to any shortcoming of knowledge on 
our part, as the ignorant think. Rather, this matter is unknown due to its nature, and its discovery 
will never be attained.” (Maimonedes) 


In each of these three cases the interest of the civilization in 7 was primarily in the practical needs of 
engineering, astronomy, water management and the like. With the Greeks, as with the Hindus, interest was 
centrally metaphysical and geometric. 


Archimedes and 7. Around 250 BCE, Archimedes of Syracuse (287-212 BCE) is thought to be the first to 
show that the “two possible Pi’s” are the same. Clearly for a circle of radius r and diameter d, Area= 7 r? 
while Perimeter = 72d, but that 71; = 7 is not obvious, and is often overlooked. Figure 4. reproduces his 
proof (construction) showing the coincidence of the two constants. 


Archimedes’ Method. The first rigorous mathematical calculation of 7 was also due to Archimedes, who 
used a brilliant scheme based on doubling inscribed and circumscribed polygons 


6H 12 24H 48H 96 


and computing the perimeters to obtain the bounds 320 <a< 34, that we have recapitulated above. The 
case of 6-gons and 12-gons is shown in Figure 5; for n = 48 one already ‘sees’ near-circles. Arguably no 
computational mathematics approached this level of rigour again until the 19th century. Phillips [9, pp. 15-19] 
calls Archimedes the ‘first numerical analyst’. 


Archimedes’ scheme constitutes the first true algorithm for 7, in that it is capable of producing an arbitrarily 
accurate value for 7. It also represents the birth of numerical and error analysis—all without positional notation 
or modern trigonometry. As discovered severally in the 19th century, this scheme can be stated as a simple, 
numerically stable, recursion, as follows [15]. 


Archimedean Mean Iteration (Pfaff-Borchardt-Schwab). Set ag = 2V3 and bo = 3—the values for 
circumscribed and inscribed 6-gons. Then define 


2anb 


(2) an+1 = f (H) bao = Van+ibn (G). 


An + bn 


This converges to 7, with the error decreasing by a factor of four with each iteration. In this case the error is 
easy to estimate, the limit somewhat less accessible but still reasonably easy [14, 15}. 


Variations of Archimedes’ geometrical scheme were the basis for all high-accuracy calculations of 7 for the 
next 1800 years—well beyond its ‘best before’ date. For example, in fifth century CE China, Tsu Chung-Chih 
used a variation of this method to get m correct to seven digits. A millennium later, al-Kashi in Samarkand 
“who could calculate as eagles can y” obtained 27 in sexagecimal: 

9 ne 16 oe yee Oe BE op Ph, OO 14 50 
vie ~ | - | - | { . 
60! * 60? * 603 " 604 * 60> * 60 " GO" * GOS * 609” 
good to 16 decimal places (using 3-2?°-gons). This is a personal favourite; reentering it in my computer centuries 
later and getting the predicted answer gave me horripilation (‘goose-bumps’). 


3 Pre-calculus Era z Calculations 


In Figures 6, 8, and 11 we chronicle the main computational records during the indicated period, only com- 
menting on signal entries. 
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Figure 5: Archimedes’ method of computing 7 with 6- and 12-gons 


Little progress was made in Europe during the ‘dark ages’, but a significant advance arose in India (450 CE): 
modern positional, zero-based decimal arithmetic—the “Indo-Arabic” system. This greatly enhanced arithmetic 
in general, and computing 7 in particular. The Indo-Arabic system arrived with the Moors in Europe around 
1000 CE. Resistance ranged from accountants who feared losing their livelihood to clerics who saw the system 
as ‘diabolical’—they incorrectly assumed its origin was Islamic. European commerce resisted into the 18th 
century, and even in scientific circles usage was limited until the 17th century. This is, of course a greatly 
simplified version of extraordinary events, for a recent article on the matter sce [6]. 


The prior difficulty of doing arithmetic is indicated by college placement advice given a wealthy German 
merchant in the 16th century: 


A wealthy (15th Century) German merchant, seeking to provide his son with a good business edu- 
cation, consulted a learned man as to which European institution o ered the best training. If you 
only want him to be able to cope with addition and subtraction, the expert replied, then any French 
or German university will do. But if you are intent on your son going on to multiplication and 
division assuming that he has su cient gifts then you will have to send him to Italy. ” (George 
Ifrah, [14]) 


Discussions about Roman arithmetic continue. Claude Shannon (1916-2001) had a mechanical calculator 
wryly called Throback 1 built to compute in Roman, at Bell Labs in 1953 to show that it was practicable, if a 
tad messy, to compute using Roman numerals! 


Ludolph van Ceulen (1540-1610). The last great Archimedean calculation, performed by van Ceulen using 
26?_gons—to 39 places with 35 correct—was published posthumously. The number is still called Ludolph’s 
number in parts of Europe and was inscribed on his head-stone. This head-stone disappeared centuries ago but 
was rebuilt, in part from surviving descriptions, recently as shown in Figure 7. It was reconsecrated on July 
5th 2000 with Dutch royalty in attendance. Ludolph van Ceulen, a very serious mathematician, was also the 
discoverer of the double angle formula for the cosine. 
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Name Year Digits 
Babylonians 2000? BCE 1 
Egyptians 2000? BCE 1 
Hebrews (1 Kings 7:23) 550? BCE 1 
Archimedes 250? BCE 3 
Ptolemy 150 3 
Liu Hui 263 5 
Tsu Ch’ung Chi 480? 7 
Al-Kashi 1429 14 
Romanus 1593 15 
van Ceulen (Ludolph’s number*) 1615 35 


Figure 6: Pre-calculus 7 Calculations 
4 Pi’s Adolescence 


Francois Viéte (1540-1603). The dawn of modern mathematics appears in Vigte s or Vigta s product (1579) 


2 V2V24V2V2tv2+v2 _ 
Ze 2 2 


Tv 


considered to be the first truly infinite product; and in the rst in nite continued fraction for 2/m given by Lord 
Brouncker (1620-1684), first President of the Royal Society of London: 


1 
9 
TT 
ne 56 
kere = 
Qaivss 


This was based on the following brilliantly ‘interpolated’ product of John Wallis? (1616-1703): 


which led to the discovery of the Gamma function (see End Note 2) and a great deal more. Variations on these 
formulas of Viete and Wallis continue to be published. 


A flavour of Viéte’s writings can be gleaned in this quotation from his work, first given in English in [9, p. 
759]. What we now take for granted was reason for much passionate argument. 


“Arithmetic is absolutely as much science as geometry [is]. Rational magnitudes are conveniently 
designated by rational numbers, and irrational [magnitudes] by irrational [numbers]. If someone 
measures magnitudes with numbers and by his calculation get them di erent from what they really 
are, it is not the reckoning s fault but the reckoner s. 


Rather, says Proclus, ARITHMETIC IS MORE EXACT THAN GEOMETRY.? To an accurate calculator, 
if the diameter is set to one unit, the circumference of the inscribed dodecagon will be the side of the 
binomial [i.e. square root of the di erence] 72 — /3888. Whosoever declares any other result, will be 
mistaken, either the geometer in his measurements or the calculator in his numbers.” (Viéte) 


?One of the few mathematicians whom Newton admitted respecting, and also a calculating prodigy! 
3The capitalized phrase was written in Greek. 
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Figure 7: Ludolph’s rebuilt tombstone in Leiden 


This fluent rendition is due to Marinus Taisbak, and the full text is worth reading. It certainly underlines how 
influential an algebraist and geometer Viéte was—as an early proponent of methods we now take for granted. 
Viéte, who was the first to introduce literals (‘x’ and ‘y’) into algebra, nonetheless rejected the use of negative 
numbers. 


Leonard Euler (1707-1783). Not surprisingly the great Euler ‘master of us all’ [20] made many contributions 
to the literature on 7. Equation (3) may be derived from Euler’s product formula for 7, given below in 
(4), with « = 1/2, or by repeatedly integrating rP sin?"(t) dt by parts. One may divine (4) as Euler did 
by considering sin(7x) as an in nite polynomial and obtaining a product in terms of the roots—which are 
0, {1/n? : n=+1,+2,---}. It is thus plausible that 


(4) ee eI (1-3). 


Euler in 1735, full well knowing that the whole argument was heuristic, argued that, as with a polynomial, 
c was the value at zero, 1, and the coefficient of x? in the Taylor series must be the sum of the roots. Hence, he 
was able to pick off coefficients to evaluate the zeta-function at two: 


1 We 
¢(m) := be i and marvellously ¢(2) = < 
n>1 
The explicit formula for ¢(2) solved the so called Basel problem posed in 1644. This method also leads to the 
evaluation of ¢(2n) as a rational multiple of 1?”: 


ne ad 7 78 
2) = 4) = = a 
(= 5,6) = 5, C6) = 5, 8) = gee 
Indeed, it produces: 
‘ie 


G(2m) = (-1)" FE Bom 


in terms of the Bernoulli numbers, B,, where t/(exp(t) — 1) = 30,59 Bnt”/n!, gives a generating function for 
the B,, which are perforce rational; see also [28]. = 
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Much less is known about odd integer values of ¢, though they are almost certainly not rational multiples of 
powers of 7. More than two centuries later, in 1976 Roger Apéry, [9, p. 439], [15], showed ¢(3) to be irrational, 
and we now also can prove that at least one of ¢(5),¢(7),¢(9) or ¢(11) is irrational, but we cannot guarantee 
which one. All positive integer values of ¢ are strongly believed to be irrational. Though it is not relevant to 
our story, Euler’s work on the zeta-function also lead ultimately to the celebrated Riemann hypothesis [14]. 


5 Pi’s Adult Life with Calculus 


In the later 17th century, Newton and Leibniz founded the calculus, and this powerful tool was quickly exploited 
to find new formulae for 7. One early calculus-based formula comes from the integral: 


“dt - : 
tanta = = (1—#4+t+-t94+---)dt 
o 1+? 0 


ine ge nt me) 
ee og 


Substituting « = 1 formally proves the well-known Gregory-Leibniz formula (1671-74) 


1 oi ve Dd 
35 7 9 Ii 


(5) a = t-24 
James Gregory (1638-75) was the greatest of a large Scottish mathematical family. The point, = 1, however, 
is on the boundary of the interval of convergence of the series. Justifying substitution requires a careful error 
estimate for the remainder or Lebesgue’s monotone convergence theorem, but most introductory texts ignore 
the issue, as they do the issue of Figure 4. The arctan integral and series was known centuries earlier to the 
Kerala school: identified with Madhava (c. 1350 —c. 1425) of Sangamagrama near Kerala, India, who may well 
have computed 13 digits of 7 by methods similar to those described in the next section. 


A Curious Anomaly in the Gregory Series. In 1988, it was observed that Gregory’s series for 7, 
& ( 1)k+1 


= tt to A 
= 4 AW Ti es eZ bed ae 
(6) 7 D1 ( a ae ok oe ) 


when truncated to 5,000,000 terms, differs strangely from the true value of 7: 


3.14159245358979323846464338327950278419716939938730582097494182 


230781640... 
3.141592653589793238462643383279502884197 16939937510582097494459 
230781640... 

2 =2 10 -122 2770 


Values differ as expected from truncating an alternating series, in the seventh place—a “4” which should be a 
“6.” But the next 13 digits are correct, and after another blip, for 12 digits. Of the first 46 digits, only four differ 
from the corresponding digits of 7. Further, the “error” digits seemingly occur with a period of 14, as shown 
above. Such anomalous behavior begs for explanation. A great place to start is by using Neil Sloane’s Internet- 
based integer sequence recognition tool, available at www.research.att.com/“njas/sequences. This tool has 
no difficulty recognizing the sequence of errors as twice Huler numbers. Even Euler numbers are generated by 
secx = )p29(—1)* Ba,27*/(2k)!. The first few are 1,—1,5,—61, 1385, —50521, 2702765. This discovery led to 
the following asymptotic expansion: 


i (—1)*+1 = Eom 
2 —— ® ; 
m ge ee 


454 


24. THE LIFE OF 7 (2014) 


Name Year | Correct Digits 
Sharp (and Halley) 1699 71 
Machin 1706 100 
Strassnitzky and Dase 1844 200 
Rutherford 1853 440 
Shanks 1874 (707) 527 
Ferguson (Calculator) 1947 808 
Reitwiesner et al. (ENIAC) | 1949 2,037 
Genuys 1958 10,000 
Shanks and Wrench 1961 100,265 
Guilloud and Bouyer 1973 1,001,250 


Figure 8: Calculus 7 Calculations 


Now the genesis of the anomaly is clear: by chance the series had been truncated at 5,000,000 terms—exactly 
one-half of a fairly large power of ten. Indeed, setting N = 10,000,000 in Equation (7) shows that the first 
hundred or so digits of the truncated series value are small perturbations of the correct decimal expansion for 
a. And the asymptotic expansions show up on the computer screen, as we observed above. 


On a hexadecimal computer with N = 16’ the corresponding strings and hex-errors are: 


3. 243F6A8885A308D313198A2E03707344A4093822299F31D0082EFA98EC4E6C8 
9452821E... 
3. 243F6A6885A308D31319AA2E03707344A3693822299F31D7A82EF A98EC4DBF6 
9452821E... 

2 =2 A 


-7A 2AD2 


with the first being the correct value of 7. (In hexadecimal or hex one uses ‘A,B, ..., F’ to write 10 through 15 
as single ‘hex-digits’.) Similar phenomena occur for other constants. (See [9].) Also, knowing the errors means 
we can correct them and use (7) to make Gregory’s formula computationally tractable, notwithstanding the 
following discussion of complexity! 


6 Calculus Era 7 Calculations 


Used naively, the beautiful formula (5) is computationally useless—so slow that hundreds of terms are needed 
to compute two digits. Sharp, under the direction of Halley*, (see Figure 8) actually used tan—!(1/V3) which 
is geometrically convergent. Moreover, Euler’s (1738) trigonometric identity 


1 1 
tan! (1) = tan~! (5) +tan7? (5) 
2 3 


produces the geometrically convergent rational series 


(8) 


(9) T 1 1,4 1 be vl 1 i 1 1 
3.93 ' 5. 7.Q7 3. 3.338 ' 5.35 


An even faster formula, found earlier by John Machin, lies similarly in the identity 


© = Atan-} : —tan7+ = 
4 5 ; 939 } 


4The astronomer and mathematician who largely built the Greenwich Observatory and after whom the comet is named. 


(10) 


10 
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0.6 07 0.8 09 1 


Figure 9: Newton’s method of computing Pi: “J am ashamed to tell you to how many gures I carried these 
computations, having no other business at the time.” Issac Newton, 1666. 


This was used in numerous computations of 7, given in Figure 8, starting in 1706 and culminating with 
Shanks’ famous computation of 7 to 707 decimal digits accuracy in 1873 (although it was found in 1945 to 
be wrong after the 527-th decimal place, by Ferguson, during the last adding machine-assisted pre-computer 
computations”). 


Newton’s arcsin computation. Newton discovered a different more effective—actually a disguised arcsin— 
formula. He considering the area A of the left-most region shown in Figure 9. Now, A is the integral 


1/4 


(11) A = Va-— x? dz. 
0 


Also, A is the area of the circular sector, 7/24, less the area of the triangle, 3/32. Newton used his newly 
developed binomial theorem in (11): 


A 


4 1/2 1/2 t 1/2 xc 2 2 5a4 
1—2)/? de = 1 a 
[ eee ae [ 1 ( 2. 8 16 128 ) 7 


z 3/2 5/2 7/2 9/2 
| 4 g2_® x x onl . 
0 2 8 16 128 


Integrate term-by-term and combining the above produces 


3V3 4, (1 1 1 1 
4 3-8 5-32 7-128 9-512 


Newton used this formula to compute 15 digits of 7. As noted, he later ‘apologized’ for “having no other 
business at the time.” (1665-1666 was the year of the great plague which closed Cambridge, and of the great 
fire of London of September 1666.). It was also directly after Newton’s Annus mirabilis that led ultimately to 
his Principia.) A standard chronology ([26] and [9, p. 294]) says Newton signi cantly never gave a value for 
mw.” Caveat emptor, all users of secondary sources. 


The Viennese computer. Until quite recently—around 1950—a computer was a person. As a teenager, this 
computer, one Johan Zacharias Dase (1824-1861), would demonstrate his extraordinary computational skill by, 
for example, multiplying 

79532853 x 93758479 = 7456879327810587 


5This must be some sort a record for the length of time needed to detect a mathematical error. 
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in 54 seconds; two 20-digit numbers in six minutes; two 40-digit numbers in 40 minutes; two 100-digit numbers 
in 8 hours and 45 minutes. In 1844, after being shown 


La rs pe trey ee ee 
oe 2 an 5 an 8 


he calculated 7 to 200 places in his head in two months, completing correctly—to my mind—the greatest 
mental computation ever. Dase later calculated a seven-digit logarithm table, and extended a table of integer 
factorizations to 10,000,000. On Gauss’s recommendation Dase was hired to assist this project, but Dase died 
not long afterwards in 1861 by which time Gauss himself already was dead. 


An amusing Machin-type identity—meaning an equation that expresses 7 as a linear combination of 
arctan’s—due to the Oxford logician Charles Dodgson is 


tan7 (5) = a0 ( : ) tan ( : ). 
Pp p+q p+rr 


valid whenever 1 + p? factors as gr. Dodgson is much better known as Lewis Carroll, the author of Alice in 
Wonderland. 


7 The Irrationality and Transcendence of 7 


One motivation for computations of 7 was very much in the spirit of modern experimental mathematics: to 
see if the decimal expansion of 7 repeats, which would mean that 7 is the ratio of two integers (i.e., rational), 
or to recognize 7 as algebraic—the root of a polynomial with integer coefficients—and later to look at digit 
distribution. The question of the rationality of 7 was settled in the late 1700s, when Lambert and Legendre 
proved (using continued fractions) that the constant is irrational. 


The question of whether 7 was algebraic was settled in 1882, when Lindemann proved that 7 is transcenden- 
tal. Lindemann’s proof also settled, once and for all, the ancient Greek question of whether the circle could be 
squared with straight-edge and compass. It cannot be, because numbers that are the lengths of lines that can be 
constructed using ruler and compasses (often called constructible numbers) are necessarily algebraic, and squar- 
ing the circle is equivalent to constructing the value 7. The classical Athenian playwright Aristophanes already 
‘knew’ this and perhaps derided ‘circle-squarers’ (reTpayywovetv) in his play The Birds of 414 BCE. Likewise, 
the French Academy had stopped accepting proofs of the three great constructions of antiquity—squaring the 
circle, doubling the cube and trisecting the angle—centuries before it was proven impossible. 


We next give, in extenso, Ivan Niven’s 1947 short proof of the irrationality of 7. It well illustrates the 
ingredients of more difficult later proofs of irrationality of other constants, and indeed of Lindemann’s proof of 
the transcendence of 7 building on Hermite’s 1873 proof of the transcendence of e. 


8 A Proof that z is Irrational 


Proof. Let 7 = a/b, the quotient of positive integers. We de ne the polynomials 


x” (a — bx)” 
n! 


f(a) = 
F(x) = f(a) — f(a) + f(a) — + + (-"f2(2) 


the positive integer being speci ed later. Since n\f(a) has integral coe cients and terms in x of degree not 
less than n, f(x) and its derivatives f9)(a) have integral values for « = 0; also for « = 7 = a/b, since 
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f(x) = f(a/b— 2x). By elementary calculus we have 
d 
air @) sing — F(x) cosz} = F(x) sinx + F(a)sinz = f(x) sine 
x 


and 


(a) sin x dx [F’(x) sina — F(a) cos a]p 


T 
0 


(12) = F(r)+F(0). 


Now F(r) + F(0) is an integer, since f9)(0) and f (1) are integers. But for0 <a <7, 


wa 
n! ? 


n 


0< f(z)sina < 


so that the integral in (12) is positive but arbitrarily small for n su ciently large. Thus (12) is false, and so is 
our assumption that 7 is rational. QED 


Irrationality measures. We end this section by touching on the matter of measures of irrationality. The 
infimum j(@) of those 4 > 0 for which 


for all integers p,q with sufficiently large q, is called the Liouville-Roth constant for a and we say that we have 
an irrationality measure for a if f(a) < oo. 


Irrationality measures are difficult. Roth’s theorem, [15], implies that p(a) = 2 for all algebraic irrationals, 
as is the case for almost all reals. Clearly, (a) = 1 for rational a and j(@) = oo if and only if a is a so-called 
Liouville number such as 3>1/10"'. It is known that p(e) = 2 while in 1993 Hata showed that p(7) < 8.02. 
Similarly, it is known that (¢(2)) < 5.45, (¢(3)) < 4.8 and pu(log 2) < 3.9. 


A consequence of the existence of an irrationality measure y for 7, is the ability to estimate quantities such 
as limsup|sin(n)|!/" = 1 for integer n, since for large integer m and n with m/n — 7, we have eventually 


+ 
n| > . 
2me-1 


1 
| sin(n)| = | sin(mz) — sin(n)| > 5 |ma 


Related matters are discussed at more length in [2]. 


9 7 in the Digital Age 


With the substantial development of computer technology in the 1950s, 7 was computed to thousands and 
then millions of digits. These computations were greatly facilitated by the discovery soon after of advanced 
algorithms for the underlying high-precision arithmetic operations. For example, in 1965 it was found that the 
newly-discovered fast Fourier transform (FFT) [15, 14] could be used to perform high-precision multiplications 
much more rapidly than conventional schemes. Such methods (e.g., for +, /% see [15, 16, 14]) dramatically 
lowered the time required for computing 7 and other constants to high precision. We are now able to compute 
algebraic values of algebraic functions essentially as fast as we can multiply, Og(M(N)), where M(N) is the 
cost of multiplication and Og counts ‘bits’ or ‘flops’. To convert this into practice: a state-of-the-art processor 
in 2010, such as the latest AMD Opteron, which runs at 2.4 GHz and has four floating-point cores, each of 
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which can do two 64-bit floating-point operations per second, can produce a total of 9.6 billion floating-point 
operations per second. 


In spite of these advances, into the 1970s all computer evaluations of 7 still employed classical formulae, 
usually of Machin-type, see Figure 8. We will see below methods that can compute N digits of 7 with time 
complexity Og(M(N)) log Og(M(N)). Proving that the log term is unavoidable, as seems likely, would yield 
an algorithmic proof—quite different from current proofs—that 7 is not algebraic. Such a proof would be a 
significant contribution to number theory and complexity theory. 


Electronic Numerical Integrator and Calculator. The first computer calculation of 7 was performed 
on ENIAC—a behemoth with a tiny brain from today’s vantage point. The ENIAC was built in Aberdeen, 
Maryland by the US Army: 


Size/weight. ENIAC had 18,000 vacuum tubes, 6,000 switches, 10,000 capacitors, 70,000 resistors, 
1,500 relays, was 10 feet tall, occupied 1,800 square feet and weighed 30 tons. 

Speed/memory. A, now aged, 1.5GHz Pentium does 3 million adds/sec. ENIAC did 5,000, three 
orders faster than earlier machines. The first stored-memory computer, ENIAC stored 200 digits. 
Input /output. Data flowed from one accumulator to the next, and after each accumulator finished 
a calculation, it communicated its results to the next in line. The accumulators were connected to 
each other manually. The 1949 computation of 7 to 2,037 places on ENIAC took 70 hours in which 
output had to be constantly reintroduced as input. 


A fascinating description of the ENIAC’s technological and commercial travails is to be found in [25]. Note that 
ENIAC as a child of the 1940’s was called a ‘calculator’ not a computer. 


Ballantine’s (1939) Series for 7. Another formula of Euler for arccot is 


ka 1)? 4” 1 
x > ie ) ye = arctan (=) : 


2— (2n4+1)! (2? +1 


This, intriguingly and usefully, allowed Guilloud and Boyer to reexpress the formula, used by them in 1973 to 
compute a million digits of 7, viz, 7/4 = 12 arctan (1/18) + 8 arctan (1/57) — 5 arctan (1/239) in the efficient 
form 


(nl)? 4” 1 
= 864 + 1824 20 arctan (—), 
. 3 orn vse » Qn+i!s207F NS (sa) 


n= a ( 


where the terms of the second series are now just ei shifts of the first. 


Ramanujan-type elliptic series. Truly new types of infinite series formulae, based on elliptic integral 
approximations, were discovered around 1910 by Srinivasa Ramanujan (1887-1920) (shown in Figure 10) but 
were not well known (nor fully proven) until quite recently when his writings were fully decoded and widely 
published. They are based on elliptic functions and are described at length in [9, 15, 14]. 


G.N. Watson (see [14]) elegantly describes his feelings on viewing formulae of Ramanujan, such as (13) below: 


. athrill which is indistinguishable from the thrill which I feel when I enter the Sagrestia 
Nuova of the Cappella Medici and see before me the austere beauty of the four statues 
representing Day, Night, Evening, and Dawn which Michelangelo has set over the 
tomb of Giuliano de Medici and Lorenzo de Medici 


One of these series is the remarkable: 


foe) 


ola (1103 + 26390k) 
01 (k!)43964" 


(13) = 
k= 
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BOOK - POST 


BY AIR M 


PAR AVION 


AN F STS & TELEGRAPHS 


Figure 10: Ramanujan’s seventy-fifth birthday stamp (courtesy Naomi Borwein-Yao 


Each term of this series produces an additional eight correct digits in the result. When Gosper used this formula 
o compute 17 million digits of 7 in 1985, and it agreed to many millions of places with the prior estimates, 
this concluded the rst proof of (13). As described in [17], this computation can be shown to be exact enough 
0 constitute a bona fide proof ! Actually, Gosper first computed the simple continued fraction for 7, hoping 
o discover some new things in its expansion, but found none. At the time of writing 500 million terms of the 
continued fraction for 7 have been computed by Neil Bickford (a teenager) without shedding light on whether 
he sequence is unbounded (see [1]). 


At about the same time, David and Gregory Chudnovsky found the following rational variation of Ramanu- 
jan’s formula. Amazingly one can show that it exists because /—163 corresponds to an imaginary quadratic 
field with class number one: 


14 
coe (3k)! (k!)3 64032034+3/2 


Each term of this series produces an additional 14 correct digits. The Chudnovskys, shown in Figure 14, 
implemented this formula using a clever scheme that enabled them to use the results of an initial level of 
precision to extend the calculation to even higher precision. They used this in several large calculations of 7, 
culminating with a then record computation of over four billion decimal digits in 1994. Their remarkable 
story was told compellingly by Richard Preston in a prizewinning New Yorker article “The Mountains of Pi” 
(March 2, 1992). 


While the Ramanujan and Chudnovsky series are in practice considerably more efficient than classical for- 
mulae, they share the property that the number of terms needed increases linearly with the number of digits 
desired: if you want to compute twice as many digits of , you must evaluate twice as many terms of the series. 

Relatedly, the Ramanujan-type series 


1 Sf (7) * 42 +5 
”) rT > (2) 16 


n=0 


allows one to compute the billionth binary digit of 1/7, or the like, without computing the rst half of the series, 
and is a foretaste of our later discussion of Borwein-Bailey-Plouffe (or BBP) formulae. 
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Name Year Correct Digits 
Miyoshi and Kanada 981 2,000,036 
Kanada-Yoshino-Tamura 982 16,777,206 
Gosper 985 17,526,200 
Bailey Jan. 1986 29,360,111 
Kanada and Tamura Sep. 1986 33,554,414 
Kanada and Tamura Oct. 1986 67,108,839 
Kanada et. al Jan. 1987 134,217,700 
Kanada and Tamura Jan. 1988 201,326,551 
Chudnovskys May 1989 480,000,000 
Kanada and Tamura Jul. 1989 536,870,898 
Kanada and Tamura Nov. 1989 1,073,741,799 
Chudnovskys Aug. 1991 2,260,000,000 
Chudnovskys May 1994 4,044,000,000 
Kanada and Takahashi Oct. 1995 6,442,450,938 
Kanada and Takahashi Jul. 1997 51,539,600,000 
Kanada and Takahashi Sep. 1999 206,158,430,000 
Kanada-Ushiro-Kuroda Dec. 2002 | 1,241,100,000,000 
Takahashi Jan. 2009 | 1,649,000,000,000 
Takahashi April. 2009 | 2,576,980,377,524 
Bellard Dec. 2009 | 2,699,999,990,000 


Figure 11: Post-calculus 7 Calculations 


10 Reduced Operational Complexity Algorithms 


In 1976, Eugene Salamin and Richard Brent independently discovered a reduced complexity algorithm for 7. It is 
based on the arithmetic-geometric mean iteration (AGM) and some other ideas due to Gauss and Legendre 
around 1800, although neither Gauss, nor many after him, ever directly saw the connection to effectively 
computing 7. 


Quadratic Algorithm (Salamin-Brent). Set ap = 1, bo = 1/2 and sp = 1/2. Calculate 


yes bp 
(16) a, = Moet (Arithmetic) be = Vae—10K-1 (Geometric) 
2 2 
(17) Ch = az — bj, Sk = 8r—-1 —2"c, and compute pp = 08 
Sk 


Then p; converges quadratically to x. Note the similarity between the arithmetic-geometric mean iteration (16), 
(which for general initial values converges quickly to a non-elementary limit) and the out-of-kilter harmonic- 
geometric mean iteration (2) (which in general converges slowly to an elementary limit), and which is an 
arithmetic-geometric iteration in the reciprocals (see [15]). 


Each iteration of the algorithm doubles the correct digits. Successive iterations produce 1, 4, 9, 20, 42, 85, 173, 347 
and 697 good decimal digits of 7, and takes log N operations for N digits. Twenty-five iterations computes 7 
to over 45 million decimal digit accuracy. A disadvantage is that each of these iterations must be performed to 
the precision of the final result. In 1985, my brother Peter and I discovered families of algorithms of this type. 
For example, here is a genuinely third-order iteration®: 


6A fourth-order iteration might be a compound of two second-order ones; this third order one can not be so decomposed. 
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Cubic Algorithm. Set aj = 1/3 and so = (V3 —1)/2. Iterate 


es ens oo 
1+ 20 — 81/8" Sk+1 5 


pet ke (2 
Troi = and Any = Tpy1ak — 8° ("744 — 1). 


Then 1/ax converges cubically to x. Each iteration triples the number of correct digits. 


Quartic Algorithm. Set a9 = 6 — 4/2 and Yo = /2 —1. Iterate 


1— (1— y¢)/4 


Vee. = 4, and Any = ag(1+ Yr)? a ae 
[+U=y)" 


ye+i(L + yeti + Yi41): 
Then 1/a, converges quartically to 7. Note that only the power of 2 or 3 used in ax depends on k. 


Let us take an interlude and discuss: 


‘Piems’ or pi-mnemonics. Piems are mnemonics in which the length of each word is the corresponding digit 
of 7. Punctuation is ignored. A better piem is both longer and better poetry. 


Mnemonics for 7 


Now I, even I, would celebrate 
In rhyme inapt, the great 
Immortal Syracusan, rivaled nevermore, 
Who in his wondrous lore, 
Passed on before 
Left men for guidance 
How to circles mensurate. (30) 


How I want a drink, alcoholic of course, after the heavy lectures involving 
quantum mechanics. (15) 


See I have a rhyme assisting my feeble brain its tasks ofttimes resisting. (13) 


There are many more and much longer mnemonics than the famous examples given in the inset box—see 
[9, p. 405, p.560, p. 659] for a fine selection. Indeed, there is whole cottage industry around the matter, 
http://www. huffingtonpost .com/jonathan-m-borwein/pi-day_b_1341569.html?ref=science. 


Philosophy of mathematics. In 1997 the first occurrence of the sequence 0123456789 was found (later than 
expected heuristically) in the decimal expansion of 7 starting at the 17,387,594, 880-th digit after the decimal 
point. In consequence the status of several famous intuitionistic examples due to Brouwer and Heyting has 
changed. These challenge the principle of the excluded middle—either a predicate holds or it does not— and 
involve classically well-defined objects that for an intuitionist are ill-founded until one can determine when or 
if the sequence occurred (see [12]). 


For example, consider the sequence which is ‘0’ except for a ‘1’ in the first place where 0123456789 first 
begins to appear in order if it ever occurs. Did it converge when first used by Brouwer as an example? Does 
it now? Was it then and is it now well defined? Classically it always was and converged to ‘0’. But, until the 
computation was done any argument about its convergence relied on the principle of the excluded middle which 
intuitionists reject. Intuitionistically, it converges now. Of course, if we redefine the sequence to have its ‘1’ in 
the first place that 0123456789101112 first begins, then we are still completely in the dark. But is a sign of how 
far computation of 7 has come recently that this was never thought necessary. 
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Figure 12: Yasumasa Kanada in his Tokyo office 
11 Back to the Future 


In December 2002, Kanada computed 7 to over 1.24 trillion decimal digits. His team first computed 7 in 
hexadecimal (base 16) to 1,030,700,000,000 places, using the following two arctangent relations: 


1 i 1 1 
= 48 tan7! — + 128 tan7! — — 20 tan7! — + 48 tan7! ——— 
. se ae og Tamas 


1 1 1 1 
= 176tan~! = + 28 tan~+ —— — 48 tan~' —- + 96tan7! ——_.. 
: an 57 7 P8539 — SP tAR Gag FMOTAR T5943 
The first formula was found in 1982 by K. Takano, a high school teacher and song writer. The second formula 
was found by F. C. W. Stérmer in 1896. Kanada verified the results of these two computations agreed, and 
then converted the hex digit sequence to decimal. The resulting decimal expansion was checked by converting 
it back to hex. These conversions are themselves non-trivial, requiring massive computation. 


This process is quite different from those of the previous quarter century. One reason is that reduced 
operational complexity algorithms require full-scale multiply, divide and square root operations. These in turn 
require large-scale fast Fourier transform (FFT) operations, which demand huge amounts of memory, and 
massive all-to-all communication between nodes of a large parallel system. For this latest computation, even 
the very large system available in Tokyo did not have sufficient memory and network bandwidth to perform 
these operations at reasonable efficiency levels—at least not for trillion-digit computations. Utilizing arctans 
again meant using many more arithmetic operations, but no system-scale FFTs, and so the method can be 
implemented using x,+ by smallish integer values—additionally, hex is somewhat more efficient! 


Kanada and his team evaluated these two formulae using a scheme analogous to that employed by Gosper 
and by the Chudnovskys in their series computations, in that they were able to avoid explicitly storing the 
multiprecision numbers involved. This resulted in a scheme that is roughly competitive in numerical efficiency 
with the Salamin-Brent and Borwein quartic algorithms they had previously used, but with a significantly lower 
total memory requirement. Kanada used a 1 Tbyte main memory system, as with the previous computation, 
yet got six times as many digits. Hex and decimal evaluations included, it ran 600 hours on a 64-node Hitachi, 
with the main segment of the program running at a sustained rate of nearly 1 Tflop/sec. 


18 


24. THE LIFE OF 7a (2014) 463 


Hex Digit Occurrences 

0 62499881108 

Decimal Digit Occurrences 1 62500212206 
2 62499924780 

0 99999485134 3 62500188844 

i 99999945664 d 62499807368 

2 100000480057 5 62500007205 

3 99999787805 6 62499925426 

4 100000357857 7 62499878794 

5 99999671008 8 62500216752 

6 99999807503 9 62500120671 

7 99999818723 A 62500266095 

8 100000791469 B 62499955595 

9 99999854780 Cc 62500188610 

D 62499613666 

Total 1000000000000 E 62499875079 
F 62499937801 


Total 1000000000000 


Figure 13: Seemingly random behaviour of single digits of 7 in base 10 and 16 
12 Why 7? 


What possible motivation lies behind modern computations of 7, given that questions such as the irrationality 
and transcendence of 7 were settled more than 100 years ago? One motivation is the raw challenge of harnessing 
the stupendous power of modern computer systems. Programming such calculations are definitely not trivial, 
especially on large, distributed memory computer systems. 


There have been substantial practical spin-offs. For example, some new techniques for performing the fast 
Fourier transform (FFT), heavily used in modern science and engineering computing, had their roots in attempts 
to accelerate computations of 7. And always the computations help in road-testing computers—often uncovering 
subtle hardware and software errors. 


Beyond practical considerations lies the abiding interest in the fundamental question of the normality (digit 
randomness) of 7. Kanada, for example, has performed detailed statistical analysis of his results to see if there 
are any statistical abnormalities that suggest 7 is not normal, so far the answer is “no” (see Figures 1 and 
13). (Kanada reports that the 10 decimal digits ending in position one trillion are 6680122702, while the 10 
hexadecimal digits ending in position one trillion are 3F89341CD5.) Indeed, the first computer computation of 7 
and e on ENIAC, discussed above, was so motivated by John von Neumann. The digits of 7 have been studied 
more than any other single constant, in part because of the widespread fascination with and recognition of 7. 
Very recent work, suggesting 7 may well be normal, can be found in and traced from [1]. 


Changing world views. In retrospect, we may wonder why in antiquity 7 was not measured to an accuracy 
in excess of 22/7? Perhaps it reflects not an inability to do so but a very different mindset to a modern more 
experimental one. One can certainly find Roman ampitheatres where more accurate measurement than 22/7 
would have been helpful. Bailey and I discuss this issue in more detail in [6]. 


In the same vein, one reason that Gauss and Ramanujan did not further develop the ideas in their identities 
for 7 is that an iterative algorithm, as opposed to explicit results, was not as satisfactory for them (especially 
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Figure 14: The remarkable Chudnovsky brothers 7 (courtesy D. and G. Chudnovsky) 


Ramanujan). Ramanujan much preferred formulae like 


3 3 
7 & —— log (5280) , 
V67 B( ) V163 


correct to 9 and 15 decimal places; both of which rely on deep number theory. Contrastingly, Ramanujan in his 
famous 1914 paper Modular Equations and Approximations to Pi (9, p.253] found 


log (640320) = a 


192\ 1/4 5 
(v . =) = 3.14159265258 --- 


“empirically, and it has no connection with the preceding theory.” Only the marked digit is wrong. 


Discovering the 7 Iterations. The genesis of the 7 algorithms and related material is an illustrative example 
of experimental mathematics. My brother and I in the early 1980’s had a family of quadratic algorithms for 
m, [15], call them Py, of the kind we saw above. For N = 1,2,3,4 we could prove they were correct but 
were only conjectured for N = 5,7. In each case the algorithm appeared to converge quadratically to 7. On 
closer inspection while the provable cases were correct to 5,000 digits, the empirical versions agreed with 7 to 
roughly 100 places only. Now in many ways to have discovered a “natural” number that agreed with 7 to that 
level—and no more—would have been more interesting than the alternative. That seemed unlikely but recoding 
and rerunning the iterations kept producing identical results. 


Two decades ago even moderately high precision calculation was less accessible, and the code was being run 
remotely over a phone-line in a Berkeley Unix integer package. After about six weeks, it transpired that the 
package’s square root algorithm was badly awed, but only if run with an odd precision of more than sixty digits! 
And for idiosyncratic reasons that had only been the case in the two unproven cases. Needless to say, tracing 
the bug was a salutary and somewhat chastening experience. And it highlights why one checks computations 
using different sub-routines and methods. 


13. How to Compute the N-th Digits of 7 


One might be forgiven for thinking that essentially everything of interest with regards to 7 has been dealt with. 
This is suggested in the closing chapters of Beckmann’s 1971 book A History of 7. Ironically, the Salamin— 
Brent quadratically convergent iteration was discovered only five years later, and the higher-order convergent 
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algorithms followed in the 1980s. Then in 1990, Rabinowitz and Wagon discovered a “spigot” algorithm for 
a—the digits ‘drip out’ one by one. This permits successive digits of 7 (in any desired base) to be computed 
by a relatively simple recursive algorithm based on all previously generated digits. 


Even insiders are sometimes surprised by a new discovery. Prior to 1996, most researchers thought if you 
want to determine the d-th digit of 7, you had to generate the (order of) the entire first d digits. This is not 
true, at least for hex (base 16) or binary (base 2) digits of 7. In 1996, Peter Borwein, Plouffe, and Bailey found 
an algorithm for computing individual hex digits of 7. It (1) yields a modest-length hex or binary digit string 
for 7, from an arbitrary position, using no prior bits; (2) is implementable on any modern computer; (3) requires 
no multiple precision software; (4) requires very little memory; and (5) has a computational cost growing only 
slightly faster than the digit position. For example, the millionth hexadecimal digit (four millionth binary digit) 
of could be found in four seconds on a 2005 Apple computer. 


This new algorithm is not fundamentally faster than the best known schemes if used for computing all digits 
of 7 up to some position, but its storage requirements, elegance and simplicity are of considerable interest, and 
it is easy to parallelize. It is based on the following at-the-time new formula for 7: 


1 4 2 1 1 
1 = 5 
(18) a > iG (S 8i+4 Bi+5 a3) 


which was discovered using integer relation methods (see [14]), with a computer search that ran over several 
months and ultimately produced the (equivalent) relation: 


: + 2arct : log5 
7 arctan | 5 0g 5, 


where the first term is a generalized Gaussian hypergeometric function evaluation. Maple and Mathematica can 
both now prove (18). They could not at the time of its discovery. A human proof may be found in [14]. 


j,i 
(19) nm = 4-oF,[ 34 


The algorithm in action. In 1997, Fabrice Bellard at INRIA—whom we shall meet again in Section 15— 
computed 152 binary digits of a starting at the trillionth position. The computation took 12 days on 20 
workstations working in parallel over the Internet. Bellard’s scheme is based on the following variant of (18): 


SS (-1)F 14 (-1)* (32 8 1 
—" 4 i 
- > FR +I) 6a 2+ 1024 dk+1' 4k+2* dk+3)’ 


which permits hex or binary digits of 7 to be calculated somewhat faster than (18) depending on the imple- 
mentation. (Most claims of improved speed of algorithms are subject to many caveats.) 


In 1998 Colin Percival, then a 17-year-old student at Simon Fraser University (see Figure 15), accessed 25 
machines to calculate first the five trillionth hexadecimal digit of 7, and then the ten trillionth hex digit. In 
September 2000, he found the quadrillionth binary digit is 0, a computation that required 250 CPU-years, using 
1784 machines in 56 countries. We record some of Percival’s computational results in Figure 15. 


Nor have matters stopped there. As described in [5, 8] in the most recent computation of 7 using the 
BBP formula, Tse-Wo Zse of Yahoo! Cloud Computing calculated 256 binary digits of 7 starting at the two 
quadrillionth bit. He then checked his result using Bellard’s variant. In this case, both computations verified 
that the 24 hex digits beginning immediately after the 500 trillionth hex digit (i.e., after the two quadrillionth 
binary bit) are: E6C1294A ED40403F 56D2D764. 


Kanada was able to confirm his 2002 computation in only 21 hours by computing a 20 hex digit string 
starting at the trillionth digit, and comparing this string to the hex string he had initially obtained in over 600 
hours. Their agreement provided enormously strong confirmation. We shall see this use of BBP for verification 
again when we discuss the most recent record computations of 7. 
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Colin Percival (1998) 


Hex strings starting 
Position at this Position 
10° 26C65E52CB4593 
107 17AF5863EFED8D 
108 ECB840E21926EC 
109 85895585A0428B 
101° 921C73C6838FB2 
1ol 9C381872D27596 
1.25 x 10!” 07E45733CC790B 
2.5 x 10!4 E6216BO69CB6C1 


Figure 15: Percival’s hexadecimal findings 
14 Further BBP Digit Formulae 


Motivated as above, constants a of the form 


= k 
(20) ae > aa 


k=0 


where p(k) and q(k) are integer polynomials, are said to be in the class of binary (Borwein-Bailey-Plou e) BBP 
numbers. I illustrate for log 2 why this permits one to calculate isolated digits in the binary expansion: 


ca | 
21 log2 = ay 
(21) og d a 


We wish to compute a few binary digits beginning at position d+ 1. This is equivalent to calculating 
{27 log 2}, where {-} denotes fractional part. We can write 


d od-k 2 od-k 
(22) {24log 2} = Hoe ty > : t} 
k=0 k=d+1 
d od-k mo °° d—k 
. {e125} 
k=0 k=d+1 


The key observation is that the numerator of the first sum in (23), 24-* mod k, can be calculated rapidly by 
binary exponentiation, performed modulo k. That is, it is economically performed by a factorization based on 
the binary expansion of the exponent. For example, 


37 = ((8)2)?)2)-3 


uses only five multiplications, not the usual 16. It is important to reduce each product modulo k. Thus, 3!7 
mod 10 is done as 


3? = 9:9? =1;17 =1;17 =1;1x3=3. 


A natural question in light of (18) is whether there is a formula of this type and an associated computational 
strategy to compute individual decimal digits of 7. Searches conducted by numerous researchers have been 
unfruitful and recently D. Borwein (my father), Gallway and I have shown that there are no BBP formulae of 
the Machin-type (as defined in [14]) of (18) for 7 unless the base is a power of two [14]. 
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Figure 16: Ferguson’s “Eight-Fold Way” and his BBP acrylic circles. These three ‘subtractive’ acrylic circles 
(white) and the black circle represent the weights [4,—-2, —2,—1] in Equation (18) 


Ternary BBP formulae. Yet, BBP formulae exist in other bases for some constants. For example, for 1? 
we have both binary and ternary formulae (discovered by Broadhurst): 


f= 3 1 16 24 8 6 1 
8H 64" (6K +1)? (6k4+2)? (6h +3)? (R44)? * (6h +5)? 
(24) 
@uZ 3 1 243 405 81 27 
27 729" \(12k+ 1)? (12k +2)? (12k +4)? (12k +5)? 
(25) 72 9 9 5 1 
(12k+6)?  (12k+7)? (12k +8)? (12k +10)? " (12k +11)? )° 


These two formulae have recently been used for record digit computations performed on an IBM Blue Gene 
system in conjunction with IBM Australia [8]. 


Remarkably the volume Vg in hyperbolic space of the gure-eight knot complement is well known to be 


oo 2n-1 
1 1 
Vg = 2V3 y n(@) y e 2.0298832128193072500424051085409 .. . 
n=1 n/ k=n 


Surprisingly, it is also expressible as 


_ V3 (-1)" 18 18 24 6 2 
e="q 2 Q7n ere (6n+2)2 (6n+3)2 (6n+4)2— cEnat 


n=0 


again discovered numerically by Broadhurst, and proved in [14]. A beautiful representation by Helaman Ferguson 
the mathematical sculptor is given in Figure 16. Ferguson produces art inspired by deep mathematics, but not 
by a formulaic approach. For instance, his knowledge of hyperbolic geometry allows him to exploit surfaces of 
negative curvature as shown in his“Kight-Fold Way”. 


Normality and dynamics. Finally, Bailey and Crandall in 2001 made exciting connections between the 
existence of a b-ary BBP formula for a and its normality base b (uniform distribution of base-b digits)". They 


7See www.sciencenews.org/20010901/bob9.asp. 
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make a reasonable, hence very hard, conjecture about the uniform distribution of a related chaotic dynamical 
system. This conjecture implies: Existence of a BBP formula base b for a ensures the normality base b of a. 
Illustratively, or log 2°, the dynamical system, base 2, is to set 29 = 0 and compute 


1 
In41 2 (« + *) mod 1. 


15 Piin the Third Millennium 


15.1 Reciprocal series 


A few years ago Jestis Guillera found various Ramanujan-like identities for 7, using integer relation methods. 
The three most basic are: 


4 co 1 2n+1 
(26) =z = (—1)"r(n)° (13 + 180n + 820n?) ( — 

7 dX (35) 

foe} 2n4+1 
(27) = = So-1)"2(n)(1 + 82+ 20n2) (5) 
n=0 

4 4 co : 2n4+1 
(28) sS = r(n)"(1 + 14n + 76n? + 1683) | = ; 

sae (5) 
where r(n) := (1/2-3/2+ +--+ + (2n —1)/(2n))/n!. 


Guillera proved (26) and (27) in tandem, using most ingeniously the Wilf Zeilberger algorithm for formally 
proving hypergeometric-like identities [14, 7, 29]. No other proof is known and there seem to be no like formulae 
for 1/74 with d > 4. The third (28) is certainly true,® but has no proof; nor does anyone have an inkling of 
how to prove it; especially as experiment suggests that it has no ‘mate’ unlike (26) and (27) [7]. My intuition is 
that if a proof exists it is more a verification than an explication and so I stopped looking. I am happy just to 
know the beautiful identity is true. A very nice account of the current state of knowledge for Ramanujan-type 
series for 1/7 is to be found in [10]. 


In 2008 Guillera [22] produced another lovely pair of third millennium identities—discovered with integer 
relation methods and proved with creative telescoping—this time for 7? rather than its reciprocal. They are 


oo a+ ae 3 9°. a 2 
(29) s: = eae (6(n+2)+1)=8e) > a ‘ 
and 
(30) > = “ — 2. (42(n+ a) +5) = 322 )> ere 


Here (a)n = a(a+1)--(a+n-— 1) is the rising factorial. Substituting « = 1/2 in (29) and (30), he obtained 
respectively the formulae 
ns 7 


co a foe) 
1 (4) fd 1 
yo Oe Gn ee and Sos ( 2 (2In +13) = —. 
92n (3); 4 96n—2 (2): 3 
2) n n=0 2/n 
8In this case it is easy to use Weyl’s criterion for equidistribution to establish this equivalence without mention of BBP numbers. 
*Guillera ascribes (28) to Gourevich, who used integer relation methods. I ‘rediscovered’ (28) using integer relation methods with 


30 digits. I then checked it to 500 places in 10 seconds, 1200 in 6.25 minutes, and 1500 in 25 minutes: with a naive command-line 
instruction in Maple on a light laptop. 
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15.2 Computational records 


The last decade has seen the record for computation of 7 broken in some very interesting ways. We have already 
described Kanada’s 2002 computation in Section 11 and noted that he also took advantage of the BBP formula 
of Section 13. This stood as a record until 2009 when it was broken three times—twice spectacularly. 


Daisuke Takahashi. The record for computation of 7 of under 29.37 million decimal digits, by Bailey in 
1986 had increased to over 2.649 trillion places by Takahashi in January 2009. Since the same algorithms were 
used for each computation, it is interesting to review the performance in each case: 


In 1986 it took 28 hours to compute 29.36 million digits on 1 cpu of the then new CRAY-2 at NASA Ames 
using (18). Confirmation using the quadratic algorithm (16) took 40 hours. (The computation uncovered 
hardware and software errors on the CRAY. Success required developing a speedup of the underlying FFT [14].) 
In comparison, on 1024 cores of a 2592 core Appro Xtreme-X3 system 2.649 trillion digits via (16) took 64 
hours 14 minutes with 6732 GB of main memory, and (18) took 73 hours 28 minutes with 6348 GB of main 
memory. (The two computations differed only in the last 139 places.) In April Takahashi upped his record to 
2,576,980,377,524 places. 


Fabrice Bellard. Near the end of 2009, Bellard computed nearly 2.7 trillion decimal digits of 7 (first in 
binary) using the Chudnovsky series (14). This took 131 days but he only used a single 4-core workstation with 
a lot of storage and even more human intelligence! For full details of this feat and of Takahashi’s most recent 
computation one can look at 

http://en.wikipedia. org/wiki/Chronology_of_computation_of_pi 


Nor is that the current end of the matter: 


Alexander Yee and Shigeru Kondo. In August 2010, they announced that they had used the Chudnovsky 
formula to compute 5 trillion digits of 7 over a 90-day period, mostly on a two-core Intel Xeon system with 
96 Gbyte of memory. They confirmed the result in two ways, using the BBP formula (as discussed above), 
which required 66 hours, and a variant of the BBP formula due to Bellard, which required 64 hours. Changing 
from binary to decimal required 8 days. This was upped to 10 trillion digits in October 2011. Full details are 
available at http: //www.numberworld.org/misc_runs/pi-5t/details.html. 


16._.... Life of 7 


Paul Churchland, writing about the sorry creationist battles of the Kansas school board, [19, Kindle ed, loc 
1589] observes that: 


Even mathematics would not be entirely safe. (Apparently, in the early 1900 s, one legislator in a 
southern state proposed a bill to rede ne the value of pi as 3.8 exactly, just to tidy things up.) 


As we have seen, the life of 7 captures a great deal of mathematics—algebraic, geometric and analytic, both 
pure and applied—along with some history and philosophy. It engages many of the greatest mathematicians 
and some quite interesting characters along the way. Among the saddest and least-well understood episodes 
was an abortive 1896 attempt in Indiana to legislate the value(s) of 7. The bill, reproduced in [9, p. 231-235], 
is is accurately described by David Singmaster, [27] and [9, p. 236-239]. 


At the end of the novel, Piscine (Pi) Molitor writes 
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I am a person who believes in form, in harmony of order. Where we can, we must give things a 
meaningful shape. For ecample I wonder could you tell my jumbled story in exactly one hundred 
chapters, not one more, not one less? Ill tell you, that s one thing I hate about my nickname, the 
way that number runs on forever. It s important in life to conclude things properly. Only then can 
you let go. 


We may well not share the sentiment, but we should celebrate that Pi knows 7 to be irrational. 


17 End Notes 


1. Why 7 is not 22/7. Today, even the computer algebra systems Maple or Mathematica ‘know’ this since 
1 4,4 
l-¢z 22 
(31) o< fs J ee rT, 
0 


though it would be prudent to ask ‘why’ each can perform the integral and ‘whether’ to trust it. Assuming we 
do trust it, then the integrand is strictly positive on (0,1), and the answer in (31) is an area and so strictly 
positive, despite claims that 7 is 22/7 ranging over millennia.!? In this case, requesting the indefinite integral 
provides immediate reassurance. We obtain 


tod 4 
a* (1-2) ly 2,5 4,3 
de = t = eP-—-t+4t-4 1 Ey 
| i¢e? 7 3 + 3 + arctan (t) , 


as differentiation easily confirms, and so the Newtonian fundamental theorem of calculus proves (31). 
One can take the idea in (31) a bit further, as in [14]. Note that 


1 
1 
32 et(l-a)‘dxr = —, 
(32) fate u)” dx 630 


and we observe that 


tp 1 (4 »)4q4 1 
(33) > | at(l—a)'dx < | (aay ae oe : a*(1—2)* de. 
0 0 


Combine this with (31) and (32) to derive: 223/71 < 22/7 — 1/630 < m < 22/7 — 1/1260 < 22/7, and so 
re-obtain Archimedes’ famous computation 
10 10 
34 38 <4 <3E. 
64) ane. (i 
The derivation above was first popularized in Eureka, a Cambridge student journal in 1971.11 A recent study 
of related approximations is [24]. (See also [14].) 


2. More about Gamma. One may define 


I(x) -{ t?-te~! dt 
0 


19One may still find adverts in newspapers offering such proofs for sale. A recent and otherwise very nice children’s book “Sir 
Cumference and the the Dragon of Pi (A Math Adventure)” published in (1999) repeats the error, and email often arrives in my 
in-box offering to show why this and things like this are true. 

11(31) was on a Sydney University examination paper in the early sixties and the earliest source I know of dates from the 1940’s 
[14]. 
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for Re « > 0. The starting point is that 
(35) «T(«) =T(«#+1), T(1) =1. 


In particular, for integer n, P(n +1) =n!. Also forO<a<1 


T 
Pe)Pd-2) =e, 

since for x > 0 we have 7 
T(x) = lim a 


n-voo []p_o(@ +k)” 
This is a nice consequence of the Bohr-Mollerup theorem [15, 14] which shows that I’ is the unique log-convex 


function on the positive half line satisfying (35). Hence, [(1/2) = \/z and equivalently we evaluate the Gaussian 
integral 


; e-® dx = Va, 


so central to probability theory. In the same vein, the improper sinc function integral evaluates as 


/ sin(2) Le 
“co £ 


Considerable information about the relationship between [ and 7 is to be found in [14, 21]. 


The Gamma function is as ubiquitous as 7. For example, it is shown in [18] that the expected length, W3, of 
a three-step unit-length random walk in the plane is given by 


3218 6 (1 97 27/35 f 2 
= ——_T® [= a fie Yi 
cy) Ws= 7678 (5) +73 @ 
We recall that ['(1/2)? = 7 and that similar algorithms exist for '(1/3),T'(1/4), and T'(1/6) [15, 14]. 


2. More about Complexity Reduction. To illustrate the stunning complexity reduction in the elliptic 
algorithms for Pi, let us explicitly provide a complete set of algebraic equations approximating 7 to well over a 
trillion digits. 


The number 7 is transcendental and the number 1/az29 computed next is algebraic; 
nonetheless they coincide for over 1.5 trillion places. 


Set ag = 6 — 4/2, yo = V2 —1 and then solve the system in Figure 17. 


This quartic algorithm, with the Salamin—Brent scheme, was first used by Bailey in 1986 [17] and was 
used repeatedly by Yasumasa Kanada (see Figure 12), in Tokyo in computations of 7 over 15 years or so, 
culminating in a 200 billion decimal digit computation in 1999. As recorded in Figure 11, it has been used 
twice very recently by Takahashi. Only thirty five years earlier in 1963, Dan Shanks—a very knowledgeable 
participant—was confident that computing a billion digits was forever impossible. Today it is ‘reasonably easy’ 
on a modest laptop. A fine self-contained study of this quartic algorithm—along with its cubic confrere also 
described in Section 10—can be read in [23]. The proofs are nicely refined specializations of those in [16]. 


3. Following 7 on the Web. One can now follow Pi on the web through Wikipedia, Math World or elsewhere, 
and indeed one may check the performance of 7 by looking up ‘Pi’ at http: //www.google.com/trends. This 
link shows very clear seasonal trends. with a large spike around Pi Day. The final spike (F) is for Tau Day 
(6.28)—a joke that many seem not have realized is a joke.!? 


12 waw.washingtonpost . com/blogs/blogpost/post/tau-day-replace-pi-make-music-with-tau/2011/06/28/AG6ub6oH_blog. html 
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1— ¥/1—yot P 2 1— V1— yi04 2 
ry eri a .a4 =a0(1+y1)*- 2 yi (IL t+y +m) Ui cal _ dar = ano (1+ yr) — 29 yur (L+ yur + ys”) 
1+ V1—-yot 1+ W1—- yi04 
1- Vi-yt a : 1 aa ; 
io = ay (Lae) = 2 yo (1+ y2 + yo") ya = yu 112 = ani (1+ yi2)* — 2° ya (1+ yo + yo”) 
14+ V1-mt 1+ VY1-yut 
1-4/1 -y!4 . 1- W/1—- yi2t 
SE Se eg a (Lge oye 1+y3 + ys") Hi cae = ae (ng) SP a l+yis + yi3’) 
1+ VY1—yt 1+ V/1-yat 
1— 4/i- yt ; 1- */i- st ’ 
Ya = ames Eee a4 = a3 (1+ ys)* - 2 ya l+ya + ys”) yi = ao VATS au = as (1+y4)* = 2 yg L+yia t+ ys’) 
1+ V1—ys* 1+ V1l-ms4 
4 1- V1l-yut 
4 oll 2 4 931 2 
Ys = a5 = a4 (1+ ys5)* —2"'ys (1 +45 + ys Yis = —— = ,, a5 = ata (1 + ys)” — 2°) ys (1+ ys + ys 
te! ) It Vint \ ) 
7 1- 4A yt 
,a5 = as (1+ y6)* — 2346 1+y6 +ys°) v6 = a te = as (1+ yis)* — 2° yr6 1+yi6 +e) 
— is 
az = ag (1+ yr) = 2? yr (1+ yr +977) pe ng met ee a ee *) 
,a7 = d6 7) — 2° yz 7+ Y7 17 = =, 17 = 16 17) — 2° yz u7 + yiz 
1+ V1—- yi6t 
4 17 2 1= 4/1 = yt 4 37, 2 
ag = a7 (1+ ys) —2°'ys (1+ ys + ys’) Wis = Tap ee = a (+ Hs) — 2°’ ys 1+yis + yis’) 
— y17* 
P 2 1- Y1—- yis4* 2 
, a9 = ag (1+ yo)* — 21° y5 1+y9 + yo") yi = —_ u ;@i9 = Gis (1+ y19)* — 2° y19 1+ yio + yo”) 
1+ V1-ms? 
i 2 1-W1- i 
ag (1+ yro)* — 27'y10 (1 + yo + yao’) yoo = 7 yao! VY, azo = aig (1+ y2o)* — 2** y20 (1+ y20 + y20”) 
1+ V1-y194 


Figure 17: The system of equations used to compute 7 to 1.5 trillion places 


4. The Difficulty of Popularizing Accurately. Let me finish on this theme. Even after many helpful 
comments from readers, errors probably remain in my article. So I tell the story below with no particular 
rancour. 


Paul Churchland in [19] offers a fascinating set of essays full of interesting anecdotes—which I have no 
particular reason to doubt. Nonetheless, the very brief quote at the start of Section 16, regarding the legislation 
of values of 7, contains four inaccuracies. As noted above: (i) the event took place in 1896/7 and (ii) in Indiana 
(a northern state); (iii) the prospective bill, #246, offered a geometric construction with inconsistent conclusions 
and certainly offers no one exact value. Finally, (iv) the intent seems to have been pecuniary, not hygienic [27]. 


As often, this makes me wonder whether mathematics popularization is especially prone to error or if the 
other disciplines just seem better described because of my relative ignorance. On April 1, 2009, an article 
entitled “The Changing Value of Pi” appeared in the New Scientist with an analysis of how the value of pi has 
been increasing over time. I hope but am not confident that all readers noted that April 1st is “April Fool’s 
day.” (See also entry seven of http://www .museumofhoaxes.com/hoax/aprilfool/.) 


Acknowledgements. Thanks are due to many, especially my close collaborators P. Borwein and D. Bailey. 
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Figure 18: Google’s trend line for ‘Pi’ 
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Paper 25: Jonathan M. Borwein, “I prefer pi: A brief history and anthology of 
articles in the American Mathematical Monthly,” American Mathematical Monthly, 
vol. 122 (2015), p. 195-216. Copyright 2015 Mathematical Association of America. 
All Rights Reserved. 


Synopsis: 

This paper, which appeared on Pi Day 2015 (3/14/15 in North American no- 
tation), presents a brief summary of papers that have appeared in the American 
Mathematical Monthly on the topic of 7. The most frequently cited papers are 
listed. 

The remainder of the article presents a historical overview of 7, ranging from a 
complete analysis of Archimedes’ technique for approximating 7 via inscribed and 
circumscribed polygons, arctangent-based formulas used in Renaissance times, one 
of the numerous proofs that 7 is irrational, the Salamin-Brent quadratically con- 
vergent algorithm for 7, the Borwein quartic algorithm, spigot algorithms, infinite 
product formulas, the curious behavior of erroneous digits in the Gregory series for 
am, and more. 


Keywords: Algorithms, Computation, History, Irrationality 
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I Prefer Pi: A Brief History and Anthology 
of Articles in the American Mathematical 
Monthly 


Jonathan M. Borwein and Scott T. Chapman 


Abstract. In celebration of both a special “big” a Day (3/14/15) and the 2015 centennial of 
the Mathematical Association of America, we review the illustrious history of the constant zr 
in the pages of the American Mathematical Monthly. 


1. INTRODUCTION. Once in a century, Pi Day is accurate not just to three digits 
but to five. The year the MAA was founded (1915) was such a year and so is the 
MAA’s centennial year (2015). To arrive at this auspicious conclusion, we consider 
the date to be given as month—-day-two-digit year.! This year, Pi Day turns 26. For a 
more detailed discussion of Pi and its history, we refer to last year’s article [46]. We 
do note that “I prefer pi” is a succinct palindrome.” 

In honor of this happy coincidence, we have gone back and selected roughly 76 
representative papers relating to Pi (the constant not the symbol) published in this 
journal since its inception in 1894 (which predates that of the MAA itself). Those 
76 papers listed in three periods (before 1945, 1945-1989, and 1990 on) form the core 
bibliography of this article. The first author and three undergraduate research students" 
ran a seminar in which they looked at the 76 papers. Here is what they discovered. 


Common themes. In each of the three periods, one observes both the commonality 
of topics and the changing style of presentation. We shall say more about this as we 
proceed. 


e We see authors of varying notoriety. Many are top-tier research mathematicians 
whose names remain known. Others once famous are unknown. Articles come from 
small colleges, Big Ten universities, Ivy League schools, and everywhere else. In 
earlier days, articles came from people at big industrial labs, but nowadays, those 
labs no longer support research as they used to. 


¢ These papers cover relatively few topics. 


° Every few years a “simple proof” of the irrationality of w is published. Such 
proofs can be found in [*58, 26, 29, 31, 39, 52, 59, 62, 76]. 

° Many proofs of ¢(2) := >>. 1/n? = 2*/6 appear, each trying to be a bit more 
slick or elementary than the last. Of course, whether you prefer your proofs con- 
cise and high tech or more leisurely and lower tech is a matter of taste and con- 
text. See [*38, *58, 20, 28, 34, 42, 57, 68, 69]. 

° Articles on mathematics outside the European tradition have appeared since the 
MONTHLY’s earliest days. See the papers [3, 9, 11, 15]. 


http://dx.doi.org/10.4169/amer.math.monthly. 122.03.195 
MSC: Primary 01A99, Secondary 11Z05 
'For advocates of t = 27, your big day 6/28/31 will come in 2031. 
?Given by the Professor in Yéko Ozawa, The Housekeeper and the Professor, Picador Books, 2003. Kindle 
location 1095, as is “‘a nut for a jar of tuna?” 
3The students are Elliot Catt from Newcastle and Ghislain McKay and Corey Sinnamon from Waterloo. 
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¢ In the past 30 years, computer algebra begins to enter the discussions — sometimes 
in a fundamental way. 

¢ Of course, the compositing style of the MONTHLY has changed several times. 

¢ The process of constructing this selection highlights how much our scholarly life 
has changed over the past 30 years. Much more can be found and studied easily, but 
there is even more to find than in previous periods. The ease of finding papers in 
Google Scholar has the perverse consequence — like Gresham’s law in economics — 
of making less easily accessible material even more likely to be ignored. 


While our list is not completely exhaustive, almost every paper listed in the bibli- 
ography has been cited in the literature. In fact, several have been highly cited. Some 
highly used research, such as Ivan Niven’s proof of the irrationality of 2 in 1947 is 
rarely cited as it has been fully absorbed into the literature [76]. Indeed, a quick look 
at the AMS’s Mathematical Reviews reveals only 15 citations of Niven’s paper. 

We deem as pi-star (or *) papers from our MONTHLY bibliography that have been 
cited in the literature more than 30 times. The existence of JSTOR means that most 
readers can access all these papers easily, but we have arranged for the z*s to be 
available free for the next year on our website (www.maa.org/amm_supplements). 
Here are the z*s with citation numbers according to Google Scholar (as of 1/7/2015). 
These papers are marked with a x in the regular bibliography. 


1. 133 citations: J. M. Borwein, P. B. Borwein, D. H. Bailey, Ramanujan, modular 
equations, and approximations to pi or how to compute one billion digits of pi, 
96(1989) 201-219. 


2. 119 citations: G. Almkvist, B. Berndt, Gauss, Landen, Ramanujan, the arithmetic- 
geometric mean, ellipses, 7, and the ladies diary, 95(1988) 585-608. 


3. 73 citations: A. Kufner, L. Maligrand, The prehistory of the Hardy inequality, 
113(2006) 715-732. 

4. 63 citations: J. M. Borwein, P. B. Borwein, K. Dilcher, Pi, Euler numbers, and 
asymptotic expansions, 96(1989) 681-687. 

5. 56 citations: N. D. Baruah, B. C. Berndt, H. H. Chan, Ramanujan’s series for 
1/z: a survey, 116(2009) 567-587. 


6. 40 citations: J. Sondow, Double integrals for Euler’s constant and In z/4 and an 
analog of Hadjicostas’s formula, 112(2005) 61-65. 

7. 39 citations: D. H. Lehmer, On arccotangent relations for zr, 45(1938) 657-664. 

8. 39 citations: I. Papadimitriou, A simple proof of the formula °°, 1/k? = 27/6, 
80(1973) 424-425. 


9. 36 citations: V. Adamchik, S. Wagon, A simple formula for 7, 104(1997) 852- 
855. 


10. 35 citations: D. Huylebrouck, Similarities in irrationality proofs for 7, In 2, ¢(2), 
and &(3), 108(2001) 222-231. 


11. 35 citations: L. J. Lange, An elegant continued fraction for 7, 106(1999) 456— 
458. 


12. 33 citations: S. Rabinowitz, S. Wagon, A spigot algorithm for the digits of z, 
102(1995) 195-203. 


13. 32 citations: W. S. Brown, Rational exponential expressions and a conjecture 
concerning z and e, 76(1969) 28-34. 


The remainder of this article. We begin with a very brief history of Pi, both math- 
ematical and algorithmic, which can be followed in more detail in [80] and [46]. We 
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then turn to our three periods and make a very few extra comments about some of 
the articles. For the most part the title of each article is a pretty good abstract. We 
then make a few summatory remarks and list a handful of references from outside 
the MONTHLY, such as David Blattner’s Joy of Pi [79] and Arndt and Haenel’s Pi 
Unleashed [78]. 


2. PI: A BRIEF HISTORY. Pi is arguably the most resilient of mathematical ob- 
jects. It has been studied seriously over many millennia and by every major culture, 
remaining as intensely examined today as in the Syracuse of Archimedes’ time. Its role 
in popular culture was described in last year’s Pi Day article [46]. We also recall the 
recent movies Life of Pi ((2012, PG) directed by Ang Lee) and Pi ((1998, R) directed 
by Darren Aronofsky)*. 

From both an analytic and computational viewpoint, it makes sense to begin with 
Archimedes. Around 250 BCE, Archimedes of Syracuse (287-212 BCE) is thought to 
have been the first ((n Measurement of the Circle) to show that the “two possible Pi’s” 
are the same. For a circle of radius r and diameter d, Area= 1, r? while Perimeter 
= md but that 2, = z> is not obvious and is often overlooked; see [55]. 


Archimedes’ method. The first rigorous mathematical calculation of 2 was also due 
to Archimedes, who used a brilliant scheme based on doubling inscribed and circum- 
scribed polygons, 


6h 12 24r 48R 96, 


and computing the perimeters to obtain the bounds 3 ST 38 =....° The case 
of 6-gons and 12-gons is shown in Figure 1; for n = 48 one already “sees” near- 
circles. No computational mathematics approached this level of rigor again until the 
19th century. Phillips in [41] or [80, pp. 15-19] calls Archimedes the “first numerical 
analyst.” 


Figure 1. Archimedes’ method of computing z with 6- and 12-gons 


Archimedes’ scheme constitutes the first true algorithm for z in that it can produce 
an arbitrarily accurate value for z. It also represents the birth of numerical and error 
analysis — all without positional notation or modern trigonometry. As discovered in the 
19th century, this scheme can be stated as a simple, numerically stable, recursion, as 
follows [82]. 


4Tmagine, an R-rated movie involving Pi! 
All rules are meant to be broken. Writing 10/70 without cancellation makes it easier to see that 1/7 is 
larger than 10/71. 
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Archimedean mean iteration (Pfaff-Borchardt-Schwab). Set ay = 2/3 and by = 
3, which are the values for circumscribed and inscribed 6-gons. If 
2an bn 


a? (H) and Dn+i = An41bn (G), (1) 


Qn41 = 


then a, and b, converge to 7, with the error decreasing by a factor of four with each 
iteration. In this case, the error is easy to estimate—look at a7, , — b7,,;—and the limit 
is somewhat less accessible but still reasonably easy to determine [82]. 

Variations of Archimedes’ geometrical scheme were the basis for all high-accuracy 
calculations of z over the next 1,800 years—far after its “best before” date. For ex- 
ample, in fifth century China, Tsu Chung-Chih used a variant of this method to obtain 
i correct to seven digits. A millennium later, al-Kashi in Samarkand “who could cal- 


culate as eagles can fly” obtained 27 in sexadecimal: 


6. 3 2 a, , 3 46 14. 50 
60° 609’ 


good to 16 decimal places (using 3 - 278-gons). This is a personal favorite; reentering 
it in a computer centuries later and getting the predicted answer gives the authors 
horripilation (“goose-bumps’”’). 

Pi’s centrality is emphasised by the many ways it turns up early in new subjects 
from irrationality theory to probability and harmonic analysis. For instance, Francois 
Viéta’s (1540-1603) formula 


2. V2V24+V2¥24+V24+ v2 
=~5 5 = 


4 


(2) 
and John Wallis’ (1616-1703) infinite product [67, 74, 75] 


m  2:2-4-4-6-6-8-8 
2° 1-3-3-5-5-7-7°9 


(3) 


are counted among the first infinitary objects in mathematics. The latter leads to the 
gamma function, Stirling’s formula, and much more [64], including the first infinite 
continued fraction® for 2/1 by Lord Brouncker (1620-1684), first president of the 
Royal Society of London: 


2 1 9 25 49 


= ; (4) 
xm 1+2+242--- 


Here, we use the modern concise notation for a continued fraction. 


Arctangents and Machin formulas. With the development of calculus, it became 
possible to extend calculations of z dramatically as shown in Figure 4. Almost all 
calculations between 1700 and 1980 reduce to exploiting the series for the arctangent 
(or another inverse trig function) and using identities to require computation only near 
the center of the interval of convergence. Thus, one starts with 


arctan(x) = x 3 5 7 t--. for -l<x<1l (5) 


6This was discovered without proof as was (3). 
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and arctan(1) = 2/4. Substituting x = | proves the Gregory—Leibniz formula (1671- 
1674) 


James Gregory (1638-1675) was the greatest of a large Scottish mathematical fam- 
ily. The point x = 1, however, is on the boundary of the interval of convergence of 
the series. Justifying substitution requires a careful error estimate for the remainder or 
Lebesgue’s monotone convergence theorem, but most introductory calculus texts ig- 
nore the issue. The arctan integral and series were known centuries earlier to the Kerala 
school, which was identified with Madhava (c. 1350 —c. 1425) of Sangamagrama near 
Kerala, India. Madhava may well have computed 13 digits of zr. 
To make (5) computationally feasible, we can use one of many formulas such as: 


arctan(1) = 2 arctan (;) + arctan (5) (Hutton) (7) 
1 1 1 
arctan(1) = arctan (5) + arctan (=) + arctan (5) (Euler) (8) 
tan(1) = 4 arct : t : Machin) (9) 
arctan(1) = 4 arctan 5 arctan 739 (Machin). 


All of this, including the efficiency of different Machin formulas as they are now 
called, is lucidly described by the early and distinguished computational number the- 
orist D.H. Lehmer [*13]. See also [2, 5, 49] and [19] by Wrench, who in 1961 with 
Dan Shanks performed extended computer computation of z using these formulas; see 
Figure 5. 

In [*13] Lehmer gives what he considered to be a best possible self-checking pair 
of arctan relations for computing z. The pair was 


1 1 1 
arctan (1) = 8 arctan (=) — arctan (<5) — 4arctan (=) (10) 


1 1 1 
arctan (1) = 12 arctan (=) + 8 arctan (=) — Sarctan (xs) . (11) 


In [2], Ballantine shows that this pair makes a good choice since the series for 
arctan(1/18) and arctan(1/57) has terms that differ by a constant factor of “0,” a 
decimal shift. This observation was implemented in both the 1961 and 1973 computa- 
tions listed in Figure 4. 


Mathematical landmarks in the life of Pi. The irrationality of 7 was first shown by 
Lambert in 1761 using continued fractions [*63]. This is a good idea since a number a 
has an eventually repeating nonterminating simple continued fraction if and only if a 
is a quadratic irrational, as made rigorous in 1794 by Legendre. Legendre conjectured 
that 7 is nonalgebraic’, that is, that 2 is transcendental. Unfortunately, all the pretty 
continued fractions for z are not simple [*63, 70, 83]. In [*63], Lange examines various 
proofs of 


7It can be argued that he was anticipated by Maimonides (the Rambam, 1135-1204) [81]. 
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rm 3 5S 7 
nw =34 . (12) 
2+2+2+4+2.-- 


Legendre was validated when in 1882 Lindemann proved z transcendental. He did 
this by extending Hermite’s 1873 proof of the transcendence of e. There followed a 
spate of simplifications by Weierstrass in 1885, Hilbert in 1893, and many others. Os- 
wald Veblen’s article [18], written only ten years later, is a lucid description of the 
topic by one of the leaders of the early 20th century American mathematical commu- 
nity.’ A 1939 proof of the transcendence of z by Ivan Niven [14] is reproduced exactly 
in Appendix A since it remains entirely appropriate for a class today. 

We next reproduce our personal favorite MONTHLY proof of the irrationality of z. 
All such proofs eventually arrive at a putative integer that must lie strictly between 
zero and one. 


Theorem 1 (Breusch [26]). z is irrational. 


Proof. Assume x = a/b with a and b integers. Then, with N = 2a, sinN = 0, 
cos N = 1, and cos(N/2) = +1. If m is zero or a positive integer, then 


2k+1 


An(x) = a 1)k(2k +1)” ae yy = Puls) cost + On (x) sin 


where P,,(x) and Q,,(x) are polynomials in x with integral coefficients. (The proof 
follows by induction on m : Ajn4+) = xdA,»,/dx, and Ap = sinx.) Thus, A,,(V) is an 
integer for every positive integer m. 

If t is any positive integer, then 


= (Qk+1—t—1)2Qk+1—t—2)---(Qk+1—2t)_, 
B, = 1)* N2K+1 
(N) oy ) (2k + 1)! 
= (2k +1)! — b\(2k +1)! 4+---4b, 
= 1)* 2k+1 
x ) (2k +1)! 


= A,(N) — bi A,-1(N) + +++ +b, Ao(N). 


Since all the b; are integers, B,(N) must be an integer too. Break the sum for B,(NV) 
into the three pieces 


[¢-1)/2] t=1 


ys Jando. 


k=0 k=[(t+1)/2] 


In the first sum, the numerator of each fraction is a product of t consecutive integers; 
therefore, it is divisible by t! and hence by (2k + 1)! since 2k + 1 < t. Thus, each term 
of the first sum is an integer. Each term of the second sum is zero. Thus, the third sum 
must be an integer for every positive integer f. 


8He was also nephew of Thorstein Veblen, one of the founders of sociology and originator of the term 
“conspicuous consumption.” 
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This third sum is 
oX 1)* (2k = t)! Nee 
= (2k + 1)!(2k — 2t)! 
~ (1! t! wet (4 (t+1¢+2) N? 
(2t+ 1)! (2t + 2)(2t + 3) 2! 
 @+DC4+2)¢4+3)¢+4) Nt 
' (2t + 2)(2t + 3)(2t + 4)(2t + 5) 4! 
Let S(t) stand for the sum in the parenthesis. Certainly 
N? 7 
|S(t)| < Pee Tagg rt =e : 
Thus, the whole expression is absolutely less than 
t! Nitti eN nen eM < (N2/t)'*te% 
(2t+ 1)! ttl ; 
which is less than 1 for tf > fo. 
Therefore, S(t) = 0 for every integer t > fo. But this is impossible because 
lim S(t) = 1 Le ee a N/2)=+1 | 
Bata ay? on ae 


A similar argument shows that the natural logarithm of a rational number must be 
irrational. From log(a/b) = c/d would follow that e° = a/b’ = A/B. Then 


.(k—t—1)(k-t Zyverth = 20) 
k! 


Bs 


k=0 


would have to be an integer for every positive integer t, which leads to a contradiction. 

Irrationality measures, denoted j1(@), as described in [83] seem not to have seen 
much attention in the MONTHLY. The irrationality measure of a real number is the 
infimum over jz > O such that the inequality 


1 
qi 


has at most finitely many solutions in p € Z and q € N. Currently, the best irra- 
tionality measure known for z is 7.6063. For 2, it is 5.095412, and for log 2, it is 
3.57455391. For every rational number, the irrationality measure is 1 and the Thue- 
Siegel-Roth theorem states that if @ is a real algebraic irrational then jz(a@) = 2. Indeed, 
almost all real numbers have an irrationality measure of 2, and transcendental numbers 
have irrationality measure 2 or greater. For example, the transcendental number e has 
(e) = 2 while Liouville numbers such as )~,,.. 1/10"' are precisely those numbers 
having infinite irrationality measure. The fact that w(7) < co (equivalently z is not 


March 2015] PI IN THE MONTHLY 201 


483 


484 25. I PREFER PI: A BRIEF HISTORY OF ARTICLES IN THE MONTHLY (2015) 


a Liouville number) was first proved by Mahler [85] in 1953.? This fact does figure 
in the solution of many MONTHLY problems over the years; for instance, it lets one 
estimate how far sin(7) is from zero. 

The Riemann zeta function"® is defined for s > 1 by ¢(s) = >>. 1/n’. The Basel 
problem, first posed by Pietro Mengoli in 1644, which asked for the evaluation of 
€(2) = >-,., 1/n’, was popularized by the Bernoullis, who came from Basel in 
Switzerland and, hence, the name. In 1735, all even values of ¢ were evaluated by 
Euler. He argued that sin(z x) could be thought of as an infinite polynomial and so 


2 


sin(z x) _ = x 
MY <a] (1-4). (13) 


n=1 


since both sides have the same zeros and value at zero. Comparing the coefficients of 
the Taylor series of both sides of (13) establishes that ¢(2) = 27/6 and then one recur- 
sively can determine a closed form (involving Bernoulli polynomials). In particular, 
¢(4) = 24/90, €(6) = 1°/945, and ¢(8) = 2° /9450 and so on. By contrast, ¢(3) was 
only proven irrational in the late 1970s, and the status of ¢(5) is unsettled—although 
every one who has thought about this knows it is irrational. It is a nice exercise to con- 
firm the values of ¢(4), ¢(6) from (13). A large number of the papers in this collection 
center on the Basel problem and its extensions; see [*58, *73, 50, 72]. An especially 
nice accounting is in [43]. As is discussed in [*24, 46], it is striking how little more is 
known about the number—theoretic structure of z. 


Algorithmic high spots in the life of Pi. In the large, only three methods have been 
used to make significant computations of zr: before 1700 by Archimedes’ method, 
between 1700 and 1980 using calculus methods (usually based on the arctangent’s 
Maclaurin series and Machin formulas), and since 1980 using spectacular series or 
iterations both based on elliptic integrals and the arithmetic-geometric mean. The 
progress of this multicentury project is shown in Figures 2, 4, and 5. If plotted on 
a log linear scale, the records line up well, especially in Figure 5, which neatly tracks 
Moore’s law. 


Name Year Digits 
Babylonians 2000? BCE 1 
Egyptians 2000? BCE 1 
Hebrews (1 Kings 7:23) 550? BCE 1 
Archimedes 250? BCE 3 
Ptolemy 150 3 
Liu Hui 263 5 
Tsu Ch’ung Chi 480? 7 
Al-Kashi 1429 14 
Romanus 1593 15 
van Ceulen (Ludolph’s number) 1615 35 


Figure 2. Pre-calculus z calculations 


°He showed ju(t) < 42. Douglas Adams would be pleased. The entire Mahler archive is on line at http: // 
carma.newcastle.edu.au/mahler/. 
!0 As expressed in Stigler’s law of eponymy, discoveries are often named after later researchers, but in Euler’s 
case, he needs no more glory. 
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Hex Digit Occurrences 
0 62499881108 
Decimal Digit Occurrences i 62500212206 
2 62499924780 
0 99999485134 3 62500188844 
1 99999945664 4 62499807368 
2 100000480057 5 62500007205 
3 99999787805 6 62499925426 
4 100000357857 7 62499878794 
5 99999671008 8 62500216752 
6 99999807503 9 62500120671 
7 99999818723 A 62500266095 
8 100000791469 B 62499955595 
9 99999854780 C 62500188610 
D 62499613666 
Total 1000000000000 E 62499875079 
F 62499937801 
Total 1000000000000 


Figure 3. Seemingly random behavior of single digits of z in base 10 and 16 


Name Year | Correct Digits 
Sharp (and Halley) 1699 71 
Machin 1706 100 
Strassnitzky and Dase 1844 200 
Rutherford 1853 440 
Shanks 1874 (707) 527 
Ferguson (Calculator) 1947 808 
Reitwiesner et al. (ENIAC) | 1949 2,037 
Genuys 1958 10,000 
Shanks and Wrench 1961 100,265 
Guilloud and Bouyer 1973 1,001,250 


Figure 4. Calculus z calculations 


The “post-calculus” era was made possible by the simultaneous discovery by Eu- 
gene Salamin and Richard Brent in 1976 of identities—actually known to Gauss but 
not recognized for their value [*24, 37, 82]—that lead to the following two illustrative 
reduced complexity algorithms. 


Quadratic algorithm (Salamin-Brent). Set ao = 1, bp = 1/ V2, and sy = 1/2. Cal- 
culate 


pt bye 
a= at (Arithmetic), Dy = ay—1bg,_-, (Geometric), (14) 
2 
— 72 2 = k _ k 
Ce =a — DE, Sk = Sey — 2% cy ~andcompute py = —. (15) 
Sk 


Then p, converges quadratically to x. Note the similarity between the arithmetic— 
geometric mean iteration (14) (which for general initial values converges quickly to 
a nonelementary limit) and the out-of-kilter harmonic—geometric mean iteration (1) 
(which in general converges slowly to an elementary limit) and which is an arithmetic— 
geometric iteration in the reciprocals (see [82]). 
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Name Year Correct Digits 
Miyoshi and Kanada 1981 2,000,036 
Kanada-Yoshino-Tamura 1982 16,777,206 
Gosper 1985 17,526,200 
Bailey Jan. 1986 29,360,111 
Kanada and Tamura Sep. 1986 33,554,414 
Kanada and Tamura Oct. 1986 67,108,839 
Kanada et. al Jan. 1987 134,217,700 
Kanada and Tamura Jan. 1988 201,326,551 
Chudnovskys May 1989 480,000,000 
Kanada and Tamura Jul. 1989 536,870,898 
Kanada and Tamura Nov. 1989 1,073,741,799 
Chudnovskys Aug. 1991 2,260,000,000 
Chudnovskys May 1994 4,044,000,000 


Kanada and Takahashi Oct. 1995 6,442,450,938 
Kanada and Takahashi Jul. 1997 51,539,600,000 
Kanada and Takahashi Sep. 1999 206,158,430,000 
Kanada-Ushiro-Kuroda Dec. 2002 1,241,100,000,000 


Takahashi Jan. 2009 1,649,000,000,000 
Takahashi April. 2009 | 2,576,980,377,524 
Bellard Dec. 2009 | 2,699,999,990,000 
Kondo and Yee Aug. 2010 5,000,000,000,000 
Kondo and Yee Oct. 2011 10,000,000,000,000 
Kondo and Yee Dec. 2013 | 12,200,000,000,000 


Figure 5. Post-calculus z calculations 


Each iteration of the Brent-Salamin algorithm doubles the correct digits. Successive 
iterations produce 1, 4, 9, 20, 42, 85, 173, 347, and 697 good decimal digits of 2, and 
take log N operations to compute N digits. Twenty-five iterations compute zr to over 
45 million decimal digit accuracy. A disadvantage is that each of these iterations must 
be performed to the precision of the final result. Likewise, we have the following. 


Quartic Algorithm (The Borweins). Set ag = 6 — 4./2 and y= J/2 — 1. Iterate 


t=(= yp 


pPk+3 
i+ —yp' 


Yeu = and ayy) = ax(1 + yey)? — yesh + yest + Yeu): 


Then 1|/a, converges quartically!' to m. Note that only the power of 2 used in a, 
depends on k. Twenty-five iterations yield an algebraic number that agrees with z to 
in excess of a quadrillion digits. This iteration is nicely derived in [56]. 

As charmingly detailed in [*21], see also [*47, 82], Ramanujan discovered that 


(16) 


1 2/2 3 (4k)! (1103 + 26390k) 
x 9801 & (k1)4396% 

Each term of this series produces an additional eight correct digits in the result. When 

Gosper used this formula to compute 17 million digits of z in 1985, it agreed to many 

millions of places with the prior estimates, this concluded the first proof of (16). As 

described in [*24], this computation can be shown to be exact enough to constitute a 

bona fide proof! Actually, Gosper first computed the simple continued fraction for zr, 


'lA fourth-order iteration might be a compound of two second-order ones; this one cannot be so decom- 
posed. 
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hoping to discover some new things in its expansion, but found none. At the time of 
this writing, 500 million terms of the continued fraction for 7 have been computed 
by Neil Bickford (then a teenager) without shedding light on whether the sequence is 
unbounded (see [77]). 

G. N. Watson, on looking at various of Ramanujan’s formulas such as (16), reports 
the following sensations [86]: 


.a thrill which is indistinguishable from the thrill I feel when I enter the Sagrestia Nuovo 
of the Capella Medici and see before me the austere beauty of the four statues representing 
‘Day’, ‘Night’, ‘Evening’, and ‘Dawn’ which Michelangelo has set over the tomb of Guiliano 
de‘Medici and Lorenzo de‘Medici. — G. N. Watson, 1886-1965. 


Soon after Gosper did his computation, David and Gregory Chudnovsky found the 
following even more rapidly convergent variation of Ramanujan’s formula. It is a con- 
sequence of the fact that ./— 163 corresponds to an imaginary quadratic field with class 
number one: 


; = (—1) (6K)! (13591409 + 545140134k) 
a) | . 
x dX (3k)! (k!)3 6403203+3/2 (17) 


Each term of this series produces an extraordinary additional 14 correct digits. Note 
that in both (16) and (17), one computes a rational series and has a single multiplication 
by a surd to compute at the end. 


Some less familiar themes. While most of the articles in our collection fit into one of 
the big themes (irrationality [57], transcendence, arctangent formulas, Euler’s product 
for sinx, evaluation of ¢(2), z in other cultures), there are of course some lovely 
sporadic examples. These include the following. 


¢ Spigot algorithms, which drip off one more digit at a time for 7 and use only 
integer arithmetic [*71, 54]. As described in [*44], the first spigot algorithm was 
discovered for e. While the ideas are simple, the specifics for 7 need some care; we 
refer to Rabinowitz and Wagon [*71] for the carefully explained details. 


* Products for z - e and m/e [35]. Melzack, then at Bell Labs, proved’? that 


e 2N 2 (—1"tn 

—= lim 1+- 18 
2e Warcod | ( : ? : 
6 2N+1 2 (-D"n 

= iin (: f ~) (19) 
we Noo 45 n 


Melzak begins by showing that lim,... V(C,)/V (Sn) = /2/(e), where S,, is the 
n-sphere and C,, is the inscribed n-dimensional cylinder of greatest volume. He then 
proves (18) and (19), saying the proof follows that of the derivation of Wallis’ for- 
mula, and he conjectures that (18) can be used to prove that e/z is irrational. We re- 
mind the reader that the transcendentality of e” follows from the Gelfond—Schneider 
theorem (1934) [82] since e7/? = i~', but the statuses of e + 2, e/m,e-7, and 1° 
are unsettled. 


!2We correct errors in Melzak’s original formulas. 
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Both (18) and (19) are very slowly convergent. To check (19), one may take logs 


and expand the series for log then exchange the order of summation to arrive at the 
more rapidly convergent “zeta’’-series 


>? feria i) 1) = log (=) 


where a(s) := )¢ pso(—-D* /(k + 1)° is the alternating zeta function, which is well 
defined for Res > 0. 
If we consider the partial products for (18), then we obtain 


2244 6 6 8 8 2N 2N+1\% 
13355 779 2WN+1 2N+2 , 
As N +> on, the left factor yields Wallis’s product for 2/2 and the right factor tends 
to 1/e, which confirms (18). A similar partial product can be obtained from (19). 


A curious predictability in the error in the Gregory—Liebnitz series (6) for 7/4 
[*25, 45]. In 1988, it was observed that the series 


OO ¢_1)kt1 


when truncated to 5,000,000 terms, differs strangely from the true value of zr: 


3.14159245358979323846464338327950278419716939938730582097494182230781640... 
3.14159265358979323846264338327950288419716939937510582097494459230781640... 
2 -2 10 -122 2770. 


Values differ as expected from truncating an alternating series: in the seventh place 
a “4” that should be a “6.” But the next 13 digits are correct and, after another blip, 
for 12 digits. Of the first 46 digits, only four differ from the corresponding digits 
of zr. Further, the “error” digits seemingly occur with a period of 14. Such anoma- 
lous behavior begs for explanation. A great place to start is by using Neil Sloane’s 
Internet-based integer sequence recognition tool, available at www.oeis.org. This 
tool has no difficulty recognizing the sequence of errors as twice the Euler num- 
bers. Even Euler numbers are generated by sec x = )\¢2.)(—1)* Ex,.x7*/(2k)!. The 
first few are 1, —1,5, —61, 1385, —50521, 2702765. This discovery led to the fol- 
lowing asymptotic expansion: 


lo @) 


ua (—1)*! Bes Eom 
: 2 %k—-1 ~ » N2mtl? (21) 


k=1 m=0 


Now the genesis of the anomaly is clear: by chance, the series had been trun- 
cated at 5,000,000 terms—exactly one-half of a fairly large power of ten. Indeed, 
setting N = 10, 000, 000 in equation (21) shows that the first hundred or so digits of 
the truncated series value are small perturbations of the correct decimal expansion 
for 7. 

On a hexadecimal computer with N = 16’, the corresponding strings and hex 
errors are 
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3.243F6A8885A308D313198A2E03707344A4093822299F31D0082EFA98EC4E6C89452821E... 
3.243F6A6885A308D31319AA2E03707344A3693822299F31D7A82EFA98EC4DBF69452821E... 
2 =2 A -7TA 2AD2 


with the first being the correct value of zr. (In hexadecimal or hex one uses “A,B, 
..., F’ to write 10 through 15 as single “hex-digits.”) Similar phenomena occur for 
other constants; see [80]. Also, knowing the errors means we can correct them and 
use (21) to make Gregory’s formula computationally tractable. 


¢ Hilbert’s inequality [*61, 48] In its simplest incarnation, Hilbert’s inequality is 


be (for an, bin € R, an, bn > 0) (22) 


n 


myn=1 


with the assertion that the constant z is best possible. Actually, 27 was the best 
constant that Hilbert could obtain. Hardy’s inequality, which originated in his suc- 
cessful attempt to prove (22) early in the development of the modern theory of 
inequalities, is well described in [*61]. One could write a nice book on the places in 
which z or ¢(2) arise as the best possible constant in an inequality. 

¢ The distribution of the digits of 2 [46]. Single-digit distribution of the first tril- 
lion digits base 10 and 16 is shown in Figure 3. All the counts in these figures are 
consistent with 2 being random. 


3. PIIN THIS MONTHLY: 1894-1944. This period yielded 20 papers for our se- 
lection. The July 1894 issue of this MONTHLY contained the most embarrassing article 
on Pi [10] ever to grace the pages of the MONTHLY. Flagged only by “published by the 
request of the author,’ who indicated it was copyrighted in 1889, it is the origin of the 
famous usually garbled story of the attempt by Indiana in 1897 to legislate the value of 
zc; see [81] and [80, D. Singmaster, The legal values of pi]. It contains a nonsensical 
geometric construction of z. So a and the MONTHLY got off on a bad footing. 

Luckily, the future was brighter. While most early articles would meet today’s crite- 
ria for publication, this is not true of all. For example, [20] offers a carefully organized 
list of 68 consequences of Euler’s product for sin given in (13) with almost no English. 
By contrast, [6] is perhaps the first discussion of the efficiency of calculation in the 
MONTHLY. 
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1. R.C. Archibald, Historical notes on the relation e~*/2) = i', Amer. Math. Monthly 28(1921) 116-121. 
MR1519723 
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4. PIIN THIS MONTHLY: 1945-1989. This second period collects 22 papers. It 
saw the birth and evolution of the digital computer with many consequences for the 
computation of z. Even old topics are new when new ideas and tools arise. A charming 
example is as follows. 


Why z is not 22/7. Did you know that 


nm? (23) 


. i—4 
0 14+ x2 7 


The integrand is strictly positive on (0, 1), so the integral in (23) is strictly positive— 
despite claims that 2 is 22/7 that rage over the millennia.!? Why is this identity true? 
We have 


‘x =a)" 1 2 4 
[- =e) dx = =t! P+r r+4t—4 arctan(t), 
0 14+ x? 7 3 3 


as differentiation easily confirms, and so the Newtonian fundamental theorem of cal- 
culus proves (23). 
One can take the idea in (23) a bit further. Note that 


i Waa ae (24) 
x*(1—x)" dx = — 
0 630’ 


and we observe that 


1 1 1 _ 4,4 1 
5 f a-ntdx< | Sa ds < f tax tar, (25) 


Combine this with (23) and (24) to derive 
223 22 1 22 1 22 


—_—-—<—, 


—< -=— << 
71 7 630 7 1260 7 


and so we re-obtain Archimedes’ famous computation 


10 
3— <n <3. (26) 


'30Qne may still find adverts in newspapers offering such proofs for sale. A recent and otherwise very nice 
children’s book “Sir Cumference and the the Dragon of Pi (A Math Adventure)” published in 1999 repeats the 
error, and email often arrives in our in-boxes offering to show why things like this are true. 
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This derivation was popularized in Eureka, a Cambridge University student journal, 

in 1971.'4 A recent study of related approximations is made by Lucas [65]. It seems 

largely happenstance that 22/7 is an early continued fraction approximate to zr. 
Another less standard offering is in [33] where Y. V. Matiyasevich shows that 


6logfem(F, ..., Fin) 


im ; 
m>ooV logilem(uy,..., Um) 


(27) 


Here, Icm is the least common multiple, fem is the formal common multiple (the 
product), and F,, is the n-th Fibonacci number with Fo = 0, F; = 1, F, = Fy-1 + 
F,,-2,n => 2 (without the square root we obtain a formula for ¢(2)). 
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5. PIIN THIS MONTHLY: 1990-2015. In the final period, we have collected 32 
papers and see no sign that interest in w is lessening. A new topic [*44, 46, 51, 81] is 
that of BBP formulas, which can compute individual digits of certain constants such 
as mr in base 2 or x” in bases 2 and 3 without using the earlier digits. The phenomenon 


'4Equation (23) was on a Sydney University examination paper in the early sixties and the earliest source 
we know of dates from the 1940s [65] in an article by Dalzell, who lamentably did not cite himself in [84]. 
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is based on the formula 


foe) 


3 1 ( 4 2 1 1 ) (28) 
~ - 16 \8i+1 8i+4 81+5 8i+6)- 


i= 


On August 27, 2012, Ed Karrel used (28) to extract 25 hex digits of m starting after 
the 10'> position. They are 353CB3F7FOC9ACCFA9AA215F2.'> In 1990, a billion digits 
had not yet been computed; see [80], and even now, it is inconceivable to compute the 
full first quadrillion digits in any base. 

Over this period, the use of the computer has become more routine even in pure 
mathematics, and concrete mathematics is back in fashion. In this spirit, we record the 
following evaluation of ¢(2), which to our knowledge first appeared as an exercise in 
[82]. 


Theorem 2 (Sophomore’s Dream). One may square term-wise to obtain 


2 
oo (—1)" a loo) l 
(> a] - Ds (2n + 1)?" 2) 


n=—Ooo n=—Co 


In particular ¢(2) = x7 /6. 


Proof. Let 
NN 
(-—1)"™" 1 
pap» yar 
n=—N m=—N (2m ou I)(2n + 1) kon (2k + 1) 
and note that dy = 7 y i ye ae We leave it to the reader to show 


that for large N the inner sum €y(7) is of order 1/(N — n+ 1), which goes to zero. 
The proof is finished by evaluating the left side of (29) to 17/4 using Gregory’s 
formula (6) and then noting that this means }°~ , 1/(2n + 1)? = 17/8. | 


Another potent and concrete way to establish an identity is to obtain an appropriate 
differential equation. For example, consider 
" exp(—x?(1 + #?)) 


f(x) — (/ a as) and g(x) =| — {age 8 


The derivative of f + g is zero: in Maple, 


2 


f:=x->Int (exp(-s*2) ,s=0..x)72; 

g:=x->Int (exp (-x72*(1+t72))/(1+t7*2) ,t=0..1); 

with (student) :d:=changevar (s=x*t , diff (f(x) ,x) ,t)+diff (g(x) ,x); 
d:=expand(d) ; 


'S All processing was done on four NVIDIA GTX 690 graphics cards (GPUs) installed in CUDA; the com- 
putation took 37 days. CUDA is a parallel computing platform and programming mode developed by NVIDIA 
for use in its graphics processing units (GPUs). 
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shows this. Hence, f(x) + g(x) is constant for 0 < x < o and so, after justifying 
taking the limit at oo, 
es) 2 7 
(/ exp(—r)at) = f (co) = g(0) = arctan(1) = a 
0 


Thus, we have evaluated the Gaussian integral using only elementary calculus and 
Gregory’s formula (6). The change of variables t? = x shows that this evaluation of 
the normal distribution agrees with [(1/2) = ./z. 

In similar fashion, we may evaluate 


FQ) := [ exp(—x’) cos(2xy) dx 
0 


by checking that it satisfies the differential equation F’(y) + 2y F(y) = 0. We obtain 


y= va exp(—y’), 


since we have just evaluated F(0) = ./7/2. 
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6. CONCLUDING REMARKS. 


It’s generally the way with progress that it looks much greater than it really is. — Ludwig 
Wittgenstein'® 


It is a great strength of mathematics that “old” and “inferior” are not synonyms. As we 
have seen in this selection, many seeming novelties are actually rediscoveries. That is 
not at all a bad thing, but it does behoove authors to write “I have not seen this before” 
or “this is to my knowledge new” rather than unnecessarily claiming ontological or 
epistemological primacy. 


ACKNOWLEDGMENT. The authors thank Ivars Peterson for his help in assembling our bibliography. They 
also thank three undergraduate research students, Elliot Catt from Newcastle and Ghislain McKay and Corey 
Sinnamon from Waterloo. Last but not least, they thank two former MONTHLY editors, Roger Horn and Dan 
Velleman, for extensive comments on an earlier draft of this paper. 


OTHER REFERENCES 


77. EF. Aragon, D. H. Bailey, J. M. Borwein, P. B. Borwein, Walking on real numbers. Math. 
Intelligencer. 35 no. 1 (2013) 42-60. DOI: http://link.springer.com/article/10.1007 
%2Fs00283-012-9340-x. 

78. J. Arndt, C. Haenel, Pi Unleashed, Springer Science & Business Media, Berlin, 2001. 

79. D. Blatner, The Joy of Pi, Walker/Bloomsbury, US, 1997. See also http://www. joyofpi.com/ 
thebook. html. 

80. L. Berggren, J. M. Borwein, P. B. Borwein, Pi: A Source Book, Springer-Verlag, (1997), ISBN: 0-387- 
94924-0. Second edition, 2000, ISBN: 0-387-94946-3. Third edition, incorporating ’*A Pamphlet on Pi,” 
2004. 


'6From “The Wittgenstein Controversy,” by Evelyn Toynton in the Atlantic Monthly, June 1997, pp. 28-41. 


212 © THE MATHEMATICAL ASSOCIATION OF AMERICA [Monthly 122 


25. I PREFER PI: A BRIEF HISTORY OF ARTICLES IN THE MONTHLY (2015) 495 


81. J. M. Borwein, The Life of Pi.’ Extended and updated version of “La vita di pi greco,” volume 2 of 
Mathematics and Culture, La matematica: Problemi e teoremi, Guilio Einaudi Editori, Turino, Italian, 
2008. 

82. J.M. Borwein, P. B. Borwein, Pi and the AGM, John Wiley and Sons, New York, 1987. 

83. J. M. Borwein, A. van der Poorten, J. Shallit, W. Zudilin, Neverending Fractions, Australia Mathematical 
Society Lecture Series, Cambridge Univ. Press, Cambridge, UK, 2014. 

84. D. Dalzell, On 22/7, J. London Math. Soc. 19(1944) 133-134. 

85. K. Mahler, On the approximation of 2, Nederl. Akad. Wetensch. Proc. Ser. A. 56 = Indagationes Math. 
15(1953) 30-42. 

86. G.N. Watson, The final problem: An account of the mock theta functions J. London Math. Soc. 11(1936) 
55-80. 


JONATHAN M. BORWEIN is Laureate Professor in the School of Mathematical and Physical Sciences and 
director of the Priority Research Centre in Computer Assisted Research Mathematics and Its Applications at 
the University of Newcastle. An ISI highly cited scientist and former Chauvenet prize winner, he has published 
widely in various fields of mathematics. His most recent books are Convex Functions (with John Vanderwerff, 
vol 109, Encyclopedia of Mathematics, Cambridge Univ. Press, 2010), Modern Mathematical Computation 
with Maple (with Matt Skerritt, Springer Undergraduate Mathematics and Technology, 2011), Lattice Sums 
Then and Now (with Glasser, McPhedran, Wan and Zucker, vol 150, Encyclopedia of Mathematics, Cambridge 
Univ. Press, 2013), and Neverending Fractions with the late Alf van der Poorten, Jeff Shallit, and Wadim 
Zudilin (Australian Mathematical Society Lecture Series, Cambridge Univ. Press, 2014). 

Division of Mathematics, School of Mathematical and Physical Sciences, University of Newcastle, NSW 2308, 
Australia. 

jonathan.borwein@ newcastle.edu.au 


SCOTT T. CHAPMAN received a bachelors degree in mathematics and political science from Wake Forest 
University, a masters degree in mathematics from the University of North Carolina at Chapel Hill, and a Ph.D. 
in mathematics from the University of North Texas. Since 2008, he has held the titles of professor and Scholar 
in Residence at Sam Houston State University in Huntsville, Texas. In 2012, he became editor of the American 
Mathematical Monthly, and his term runs through 2016. 

Sam Houston State University, Department of Mathematics and Statistics, Box 2206, Huntsville, 

TX 77341-2206, USA. 

scott.chapman@ shsu.edu 


A. APPENDIX: I. NIVEN - THE TRANSCENDENCE OF z [14]. Among the 
proofs of the transcendence of e, which are in general variations and simplifications 
of the original proof of Hermite, perhaps the simplest is that of A. Hurwitz.!’ His 
solution of the problem contains an ingenious device, which we now employ to prove 
the transcendence of z. 

We assume that z is an algebraic number, and show that this leads to a contradiction. 
Since the product of two algebraic numbers is an algebraic number, the quantity iz is 
a root of an algebraic equation with integral coefficients 


6,(x) = 0, (30) 
whose roots are @; = i77, 2, 03, ..., @,. Using Euler’s relation e’” + 1 = 0, we have 
(e“! + 1) (e +.1)---(e" +1) =0. (31) 


!7\. Hurwitz, Beweis der Transendenz der Zahl e, Mathematische Annalen, vol. 43, 1893, pp. 220-221 (also 
in his Mathematische Werke, vol. 2, pp. 134-135). 
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We now construct an algebraic equation with integral coefficients whose roots are 
the exponents in the expansion of (31). First consider the exponents 


Oy +O, A +3, U2 +3, ..., Api + An. (32) 
By equation (30), the elementary symmetric functions of a, @2,...,@, are rational 
numbers. Hence the elementary symmetric functions of the quantities (32) are rational 
numbers. It follows that the quantities (32) are roots of 


43(x) = 0, (33) 


an algebraic equation with integral coefficients. Similarily, the sums of the @’s taken 
three at a time are the ,,C3 roots of 


03(x) = 0. (34) 


Proceeding in the same way, we obtain 


O4(x) = 0, O5(x) = 0,..., An(x) = 0, (35) 
algebraic equations with integral coefficients, whose roots are the sums of the w’s taken 
4,5,...,n at atime respectively. The product equation 

9; («)62(x) ++ +O, (x) = 0, (36) 


has roots that are precisely the exponents in the expansion of (31). 
The deletion of zero roots (if any) from equation (36) gives 


O(x) = cx" ex" | +--- +c, =0, (37) 


whose roots 6, 62,..., 8, are the non-vanishing exponents in the expansion of (31), 
and whose coefficients are integers. Hence (31) may be written in the form 


ei + eh24...4e% +k =0, (38) 


where k is a positive integer. 
We define 


_ xP {O(x)}? 
f(Qx)= “Get (39) 


where s = rp — 1, and p is a prime to be specified. Also, we define 
F(x) = f(x) + FOG) + FOR) +.--+ FSP), (40) 


noting, with thanks to Hurwitz, that the derivative of e~* F(x) is —e~* f (x). Hence we 
may write 


e F(x) — & F(0) = - —e* f (E)dE. 
0 
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The substitution € = tx produces 


1 
F(x) — e*F(0) = -x f el-* F(rx)dt. 
0 


Let x range over the values 6,, 62, ..., 6, and add the resulting equations. Using (38) 
we obtain 
r r 1 
\> F(B)) +kFO) =—>- B; / e"-Pi f (TB, )dt. (41) 
j=l j=l 9 


This result gives the contradiction we desire. For we shall choose the prime p to make 
the left side a non-zero integer, and make the right side as small as we please. 
By (39), we have 


r 


y 7" =6, for O<t<p. 


j=l 


Also by (39) the polynomial obtained by multiplying f(x) by (p — 1)! has integral 
coefficients. Since the product of p consecutive positive integers is divisible by p!, the 
pth and higher derivatives of (p — 1)! f(x) are polynomials in x with integral coeffi- 
cients divisible by p!. Hence the pth and higher derivatives of f(x) are polynomials 
with integral coefficients, each of which is divisible by p. That each of these coeffi- 
cients is also divisible by c* is obvious from the definition (39). Thus we have shown 
that, for t > p, the quantity f(8;) is a polynomial in ; of degree at most s, each of 
whose coefficients is divisible by pc*. By (37), a symmetric function of 6,, B2,..., B, 
with integral coefficients and of degree at most s is an integer, provided that each 
coefficient is divisible by c’ (by the fundamental theorem on symmetric functions). 
Hence 


r 


Df B) = phe, (t=p,ptl,...,pt+s) 


j=l 
where the k, are integers. It follows that 


n+s 


> FB) = Pk 
j=l '=p 


In order to complete the proof that the left side of (41) is a non-zero integer, we now 
show that k F (0) is an integer that is prime to p. From (39) it is clear that 


f(O) = 0, (t=0,1,..., p — 2) 
pie a 
f°O=PK,, (=p, ptl,...,p+s) 


where the K, are integers. If p is chosen greater than each of k, c, c, (possible since 
the number of primes is infinite), the desired result follows from (40). 


March 2015] PI IN THE MONTHLY 215 


498 25. I PREFER PI: A BRIEF HISTORY OF ARTICLES IN THE MONTHLY (2015) 


Finally, the right side of (41) equals 


> a ek eli dr, 
c Jo (p — 1)! 


This is a finite sum, each term of which may be made as small as we wish by choosing 
p very large, because 


fig OE 
pos (p—D! 
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I Prefer Pi: Addenda 


Jonathan Borwein and Scott Chapman 


Abstract. In the rush to prepare our March 2015 article on Pi [1], several infelicities escaped 
our eye. Herein we repair the damage. 


1. MATHEMATICAL CORRIGENDA. Lord Brouckner’s continued fraction given 
in [1, (4)], and mentioned in the text above, should have been 


4 fe 2 se Fe ae AS 


Las 2 2 i i 


2. BIBLIOGRAPHIC CORRIGENDA. 
¢ References [33] and [74] omit the names of co-authors. The references should read 
as follows. 


33. R. Guy and Y. Matiyasevich, A new formula for 2, Amer. Math. Monthly 93 
(1986) 631-635. 


74. J. Sondow and H. Yi, New Wallis- and Catalan-type infinite products for pi, e, 
and ./(2 + ./(2)), Amer. Math. Monthly 117 (2010) 912-917. 


¢ The following paper, with 33 Google ciatations, was omitted from the regular bibli- 
ography and the special bibliography on p. 196. We offer our apologies to Professor 
Osler. 


T. J. Osler, The united Vieta’s and Wallis’s products for pi, Amer. Math. Monthly 
106 (1999) 774-776. 


ACKNOWLEDGMENT. The authors wish to thank the readers who drew our attention to these lacunae. 
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Mathematical Monthly, Amer. Math. Monthly 122 (2015) 195-216. 
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